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EFFECTIVE QUALITY CONTROL PROGRAM FOR THE INDUSTRIAL CONTROL LABORATORY 


Frank W. Kroll, Jr. 
Esso Standard Oil Company 


Esso Standard's Bayway Refinery has pioneered in the effective appli- 
cation of statistical quality control to improve the management of the 
control laboratory. While this application is only the first phase of a 
broad program, the reader will find a well integrated laboratory quality 
control program described in this paper. The net result of mking the 
"quality control function" an integral part of the normal line-super- 
visory responsibility has been improved testing reliability for many 
standard, well established chemical and physical test methods. 


Introduction 


An oil refinery produces a large number of products and performs 
intermediate and finished-product inspections by the thousands every day. 
To do this inspection job, many tests are employed by staffs of trained 
laboratory technicians. The measurement of product quality has become 
increasingly important in the post-war years along with an increasingly 
competitive market for petroleum-refined products. Iack of reliable 
product quality measurements, in this kind of situation, may be respon- 
sible for excessive "product-quality giveaway” and improvement in the 
accuracy and precision of test methods leads to real dollar savings to a 
refinery. It was primarily for this reason that, two years ago, organi- 
zation and direction was given to the use of statistical quality control 
as & management tool to improve the reliability of laboratory inspection 
results. Alternate ways of accomplishing our objective were considered. 
Multi-testing (analyzing enough replicate tests to reduce the standard 


deviation of the reported result), development of new test methods, 
increased emphasis on laboratory technician training, revision and 
simplification of tests and the rewriting of test procedures were all 
tried; nome of these approaches accomplished the job of improving and 
maintaining work-quality effectively. 


It is the purpose of this paper to describe the overall laboratory 
quality control program and to point to some of the important details 
which have a bearing on the success of such a@ program. 


Management Philosophy of the Program 


The single most important factor which prevents a quality control 
program from becoming a one-shot affair (usually in the form of a 
committee activity) is the continuing interest and patronage of top- 
management. To accomplish this, management must be “inspired” by a 
purposeful and practical philosophy. For the industrial inspection 
laboratory, this philosophy mst, of necessity, involve all personnel 
concerened with the production of the “laboratory commodity - the 
inspection result". It must involve, in particular, those who dispense 
this commodity to the customers - the line-supervisory foreman. It is 
axiomatic that contimued acceptance of the commodity by customers 
depends on the continued high-quality of the daily routine inspection 
result. With these concepts in mind, it was decided to fortify the 
laboratory foremen with a continuous system of measuring the quality 
(reliability) of routine inspection result - in short, to give the 
foreman the laboratory control chart as an efficient, low cost “extra 
man" to monitor the work-quality of the personnel and equipment turning 
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out these results. The important feature of this philosophy is that, 
contrary to some of the quality control programs described in recent 
literature, the laboratory foreman did not become the object of control, 
but rather became responsible for it. Under these circumstances, the 
overall “quality consciousness” of all laboratory personnel was improved 
and maintained. 


Foreman Training Course 





Since the responsibility of laboratory work-quality rests with the 
foremen, it was decided to concentrate training efforts on this group. 
A curriculum was designed to instruct these men in the theory, mechanics 
and necessary calculations to set up and maintain laboratory control 
charts; a copy of the program outline may be found in Figure I. 


The formal instruction consisted of four 1-1/2 hour sessions. This 
was handled by two University professors actively engaged in the field 
of statistical quality control. The lecture presentations were pitched 
at the practical level, supplying only the minimm of statistical theory. 
The sessions were spaced two weeks apart to allow practice of the techni- 
ques between classes; most of the examples used during class were drawn 
from current laboratory data. An intensified follow-up program was 
carried out after campletion of the formal instruction; this was assigned 
to a technical coordinator. It consisted of technical assistance to 
compile data, set initial control chart limit lines and provide a 
system of cammnication to laboratory management on the progress of all 
active control charts. It is believed that this intensified follow-up 
was in large part responsible for the contimued interest and improved 
effectiveness of the foremen in translating the subjects of the lectures 
into practical applications. 


Organization and Communication 





It is axiomatic that the effectiveness of a quality control program 
must be maximized by an efficient system of commnication fram top to 
bottom of the laboratory organization as well as between the laboratory 
and its customers. Figure II illustrates the organization which was 
set up to do this job. Below are described the various activities which 
have been effective in keeping the program vigorous and continually 
expanding: 


1. Preparation of a monthly summary table on the status of all 
active control charts by the general foremen of the laboratory 
control groups to the laboratory head. A copy of this is also 
sent to the technical coordinator. The assignment to prepare 
this report has clearly left the responsibility for work- 
quality in the hands of the foreman and has provided the 
incentive to keep this vigilance continuous. 


2. The technical coordinator calls a monthly meeting of the general 
foremen, technical supervisors and laboratory head to discuss 
mutual problems and progress made. Special problems concerning 
specific tests are brought to the attention of the foremen by 
the technical coordinator. This meeting has served to tie 
together the individual programs in each control group as well 
as to keep management directly informed as to the progress of 
the overall program. 


3. Each laboratory control group places the cwrent mximm 
allowable testing error on the daily summary inspection 
report sent to the process operating personnel; these 
statistics are supplied by the technical coordinator. 
Specifically, each inspection reported to process personnel 
is accompanied by a statistic defining the maximm allowable 
difference between two independent tests on the same sample. 
This difference is the standard deviation miltiplied by 
2 2. This commmication is considered to be very important, 
since (1) it relates testing reliability to a specific result 
received by process personnel in non-statistical language 
and (2) it provides an on-the-spot basis for judging the 
wisdom of “another sample” if the first result reported is 
not a “desirable” one. Our records show this action has 
reduced the mmber of second and third samples requested by 
process personnel to “check the first figure". 


4. The technical coordinator prepares a complete quarterly 
tabulation of current standard deviations of test methods 
covered with control charts. This letter has been effective 
as a guide to technical personnel in making decisions as to 
the validity of product quality differences reflected by 
plant-test data. In addition, it provides reliable work- 
quality indexes of our laboratories which may be compared 
with those of other company laboratories. 


Procedure of Activating Control Charts 





A general, step-wise procedure has been followed in setting a 
control chart into routine operation. The following describes the 
sequence of events that has been followed: 


1. Assignment to activate a control chart for a specific test 
method to a foreman by the laboratory head. 


2. A system of accumilating data is set up by the foreman. 
This is designed so that control chart limit lines can be 
set in about 2 - 4 weeks. 


3. Comtrol chart limit lines are calculated and drawn. The 
chart is displayed at the site of the equipment used by the 
laboratory technician. This step usually involves the 
services of the technical coordinator of all chart activity. 


4, Subsequent analysis for control chart purposes are scheduled 
and the data are plotted as accwmilated by the foreman. It 
is this activity by the foreman that has been responsible 
for improvement in test reliability. With the chart dis- 
played on site, a simple observation of the chart by the 
foreman for out-of-control points has made rapid corrective 
action possible. Since points appear on most charts at 
least once every two days, changes in work-quality due to 
different technicians have permitted judicious selection of 
technicians requiring further training. Enforcement of 
adherence to standard procedures and use of proper equipment 
have also resulted in precision improvement. 

















5. Control limit lines are recalculated once a month. Changes 
in level of precision are relayed to the coordinator and 
reported once a month to laboratory management. 


Pigure III illustrates the typical history of a control chart. For 
purposes of illustration, average and control limit lines have been 
drawn for all the data (covering about 5 months). Figure III illustrates 
one of the two types of charts employed - the “standard sample" chart. 
Both this type and the “range” chart are employed by the laboratories: 


(1) The standard sample control chart mikes use of two simple graphs- 
one for a measure of the changes in level of results with time, 
the X plot, and a second for the measure of the spread of 
results, the R chart. The abcissa for both plots is time, the 
ordinate a quality characteristic; for refinery products it is 
normally a specification (sulfur content, flash point, viscosity, 
etc.). A practical example of this type of chart is illustrated 
in Pigure III. The test method in question had been in use for a 
mumber of years and was thought to be very reliable. As the data 
from blind and random submission of a standard sample began to 
show a stable level, but undesirable ranges (from 3/29 to 6/4/54), 
it was concluded that a change in technician was desirable; an 
examination of equipment and procedure was also instituted. The 
result of this short-term study by the foreman showed the equip- 
ment to be the important factor; in this case a change fram a 
porcelain to a platimm crucible to carry out our ashing pro- 
cedure was instituted. The out of control points No. 11 for 
ranges and No. 12 for levels were attributed to poor equipment 
and an umwise change in technicians respectively. With a change 
in equipment and the original technician, subsequent ranges 
(points 15-24) reflected an improvement in generally lower and 
more stable ranges; the level of results during this time re- 

i A a to that reflected by points 1 through 6 (3/29 to 
5/13/54). 


The above experience is illustrative of the typical results of 
employing the control chart technique. Examination of chart 
information normally reveals that for a period of one to three 
months a standard well-established test method seems to behave 
as expected; enough time must be allowed for all possible 
variables to exert their influence on the overall variability. 
The test method used in the example continued to reflect ranges 
higher than were originally thought to be normal before any 
control chart examination was undertaken. Currently, intensive 
procedure with a modern instrumental technique. 


(2) The second t of chart, illustrated in Figure III-A, is a plot 
of Ranges (R) only. This is used to monitor methods employed to 
analyze products which are not stable enough to keep on hand in 
the form of standard samples. In the example (Migure III-A), a 
reduction in range (R) with time reflects an improvement in 
precision. Control limit lines were reset to conform to a 
lower average range toward the latter part of the time interval 

(2 months) covered by the chart. 








Accomplishments 





Over a two-year period, the quality control program described has 
made it possible to place 75 - 80% of the routine inspection work load 
in statistical control. This includes practically all specification 
quality evaluations carried out by the laboratories. It involves over 
100 test methods, many of which have been oved and all of which have 
arrived at a state of “statistical control” (consistently reliable at a 
specified level of precision). It is emphasized that this was accam- 
plished as a result of the efforts of laboratory technicians and foremen 
as a part of their daily work routine. 


Figure IV lists typical improvements that have been experienced with 
about 90% of the test methods monitored for quality with laboratory 
control charts. Normally, a reduction to about one-half of the before- 
chart standard deviation is experienced; some of the tests improved to 
as mich as one-fifth of the before-chart standard deviation. 


The program has worked very successfully from the standpoint of 
calculated monetary returns due to improved test reliability. Perhaps 
@& more important advantage, although intangible, has been a reaffirmed 
interest of laboratory personnel, foremen as well as technicians, in the 
significance and quality of their efforts. 


Most test methods monitored with control charts arrive at a stable 
level of precision. This condition represents the best practical level 
of precision that can be expected to result from routine use of control 
charts. It also represents a history which serves as a reliable start- 
ing point for technical analytical development effort, if the level of 
precision is still not satisfactory. Some tests show no improvement 
after a control chart has been in use for several months; it is our 
experience that such tests require a major change such as an entire new 
procedure, if improvement is economically justifiable. These experiences 
clearly indicate that the control chart approach has accomplished two 
things: (1) it has afforded a very effective and inexpensive method of 
improving laboratory testing variability for many tests at one time and 
(2) it has served as a guide in deciding where concentrated analytical 
development effort should be applied. It is only after these two areas 
of activity have been exhausted that milti-testing is considered as a 
means of improving the reliability of results reparted by the control 
laboratory. 


Other Benefits 





Among the many decisions a laboratory foreman must make every day, 
it is in his domain to decide whether a sampled product is on or off 
specification; more generally, to decide what the quality level of the 
product is on the basis of a single test result. Daily decisions are 
made by these men whether to (a) report the original result, (b) request 
another sample or (c) retest the same sample and report the first or 
second result or average these results. It is in this area that a 
strong statistical quality control program offers the foremen the “basis 
for decision” in their daily discussions with process personnel concern- 
ing the "true™ quality of the product inspected. It offers him valid 
arguments to counteract coercion by process personnel to (a) retest a 
sample when actually not warranted, te) accept a second, third and at 
times as many as 6 - 8 samples in order to obtain a “desirable” test 
result. 
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One of the benefits resulting from the use of control charts on 
test methods lies in the fact that practically every test so monitored 
soon exhibits “statistical control"; this is considered an improvement 
even though the data do not exhibit a reduction in standard deviation. 
Under these conditions, every single test result is the most probable 
candidate for the average product quality measured; in addition, definite 
statements can be made as to the maximm allowable difference between 
two independent test results on the same sample. It was with these 
considerations in mind that a "Decision - Function Policy” for laboratory 
foremen was formated. Figure VI is a copy of the official policy. It 
is obvious that without reliable work-quality indexes (standard devia- 
tions) of test methods such ae policy could not be formated. It has 
been very well received by foremen, and has saved time and money in 
making decisions concerning intermediate as well as finished product 
quality. The enforcement of this policy has magnified the fact that the 
quality control function is a vital part of the laboratory foremen's 
responsibility. 


Economic Evaluation of Accomplishments 





Multi-testing has been suggested as a means of improving routine 
testing variability, especially for those test methods which have a high 
economic value. The cost of such a program is defined as the mmber (N) 
of replicate tests that mst be averaged to reduce the variability 
(standard deviation) of test methods to a desired level. The mmber of 
replicates that must be run is calculated as the square of the ratio of 
the current standard deviation to the desired standard deviation. This 
mumber (N) is calculated from the fundamental relationship 


G=¢ YN 
it se ag 
cz 


' 
Where G = standard deviation of a single determination 


Gz = ctantert Govtetion of tte average of N 
replicate determinations 


On this basis, four replicate analyses mst be performed and averaged to 
be equivalent to a reduction to one-half of the before-control chart 
standard deviations; Figure IV illustrates the reduction of the before- 
chart standard deviations which resulted from the control chart approach. 
It is on this basis that the economic accomplishments of the program has 
been calculated. A sample calculation may be found in Figure V. Since 


ed in the example, the incentive to use statistical quality control runs 
the thousands of dollars per day and into the millions of dollars 
per year. Of course, it will be clear that a milti-testing program on a 
large scale would never have been undertaken. Nevertheless, the mgni- 
tude of the high cost of mlti-testing campared to the use of control 


charts clearly illustrates the wisdom of going this route; in fact, any 
alternate route would present a prohibitive cost. 


Summary 


The quality control program described in this paper has been a suc- 
cessful venture for two years. The laboratory control chart has pro- 
vided realistic measurement of prevailing tegting variabilities for 
many standard well-established test methods. In many instances, spe- 
cific and removable causes have been detected and eliminated or at least 
minimized after isolation and correction. In addition, the quality 
control program has served to maintain levels of performance and ranges 
of variation. Most important, the quality control system described has 
provided a continuous, representative check on overall laboratory per- 
formance, and therefore reflects management acceptance of its respon- 
sibility for maintaining standard and stable conditions. As a result, 
the Imge inspection job of our refinery today is being run like an 
efficient business. The commodity we sell is data. Our aim is more 
reliable data for the customers we serve. Quality control has been an 
effective means for ioing this job. 





FIGURE I 


TRAINING PROGRAM OUTLINE 





First Session: Fundamental Concepts 





a. demonstration with models and gadgets 

b. definition of "quality" and"statistics" 
implied in the title 

c. patterns of "normal" variation 


Second Session: Types of Variation 





a. frequency distribution 

b. demonstration of kind of data used 

¢. concept of "population" and sampling for an 
estimate of variation 


Third Session: The Control Chart for Variables 





a. application in the laboratory 
1. standard sample 
2. for duplicate analyses (one example for 
each type) 
b. applications in the plant 
1. demonstration with case history 


Fourth Session: The Control Chart (continued) 





a. for moving average and range 
b. discussion of mechanics for maintaining charts - 
illustrated with advanced case histories 





FIGURE II 


ORGANIZATION FOR EFFECTIVE QUALITY CONTROL 

















General 
General Foreman Foreman General Foreman 

Monthly Technical Monthly Summary 
Progress Report <j Coordinator -———® to Laboratory 
Meeting Management 
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Work-Quality Indexes Quarterly Summary Report to 

on Daily Inspection all Customers of Laboratory 

Reports to Process Service 
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FIGURE IV 


TYPICAL REDUCTION OF STANDARD DEVIATION 
AFTER USE OF CONTROL CHARTS 
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FIGURE V 


SAMPLE CALCUIATION OF DOLLAR SAVINGS 
DUE TO REDUCTION OF STANDARD DEVIATIONS 





Test Method: Vapor Pressure 
Unit Cost: X Dollars * 
Avg. No. tests run per day: 13 
Std. Dev. before control chart: 1.0 
Std. Dev. after control chart: 0.33 
$/day 
Present cost to handle workload: 13X 
Cost of control chart analyses: xX 
Total cost of present system: 14x 
Cost of multi-testing: (9) (13) (Xx) 117x 
Dollars/day saved: 103X 


*Unit costs vary from 2 to 20 dollars 
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FIGURE VI 


DECISION - POLICY 





FOR 


REPORTED LABORATORY INSPECTION 





Introduction 


The procedure outlined below is designed to serve as a guide to 
laboratory foremen in making decisions concerning the question of 
whether or not to report a test result on a sample. Its purpose is to 
enable foremen to make this decision on a maximm of two samples. 


Procedure 


Case I: When current control chart data shows test method employed 
to be in statistical control. 


1. Report the result as is. 


Case II: When current control chart data shows test method employed to 
be out of control. 


1. Re-test independently on same sample; if the difference 
between these two results is within maximm allowable 
error, report the average of the results; if the dif- 
ference is greater than the maximm allowable error, 
discontinue analysing routine samples until the method 
is known to be in control. 








2. If not enough of original sample is available for second 
analysis, request a second sample of sufficient size to 
make two determinations. If the difference between the 
first analysis of the second sample and the original re- 
sult is within the maximum allowable error, report the 
average of these two results. 


If the difference between original result and first analy- 
sis of second sample is greater than the maximm allow- 
able error, analyze the second sample a second time. If 
the difference between the two results on second sample 

is within the maximm allowable error, report the average 
of these results; if the difference is greater than the 
maximum allowable error, discontinue analyzing routine 
samples until the method is known to be in control. 








Case III: When there is no control chart in service for the method 
employed. 


1. When the test method employed is believed to be unreliable, 


obtain two independent test results on each sample. Report 
the average of these results. 
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When the range (differences) between at least 15 duplicate 
tests, exhibits statistical control, analyze samples only 
once and report that result. 


y- 
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MANAGING THE QUALITY CONTROL FUNCTION 
Robert G. Mitchell 
International Latex Corporation 
INTRODUCTION 


In order to discuss a subject as broad as the managing of anything, 
certain restraints must be drawn around it so that the discussion may 
have some fruitful meaning to the participants. I would, therefore, 
prefer to generally express some thoughts on managing the function in 
which all of us are apart rather than trying to give anyone the opinion 
that here is a complete rundown on the entire topic. Moreover, I will 
confine these remarks to our overall company situation rather than use 
any outside criteria. 


It is my opinion that before we can proceed with anything else, we 
must understand the company atmosphere around us and our place in it. 
As a staff organization within Manufacturing, we are responsible for 
serving a line group on all quality matters, We are dedicated to assist- 
ing line management through close teamwork to successfully manufacture a 
product economically that conforms to raw material and product specifi- 
cations drawn up by Research and accepted by the line in Manufacturing. 


If we agree that this is the situation in which we find ourselves 
in broad terms, then it is the "how" of assisting line management 
effectively that we really want most to direct our attention for this 
discussion on "managing". 


Each of us is a manager of all or part of the Quality Control func- 
tion in his particular plant. Each of us has problems in common regard- 
less of plant location or product differences, The art of management, 
while it must fit the conditions of the plant and products concerned, 
still remains a basic field unto itself. We can manage well, we can 
manage poorly, or we can just struggle along; but to really succeed in 
our work in Quality Control, we have only the obvious alternative! 


DEFINING THE FUNCTION 


Each of us has been given and understands the objectives of Quality 
Control in his company. We all operate under a similar scope of function 
These objectives and scope of activity provide us basic guides for 
operating. They are prerequisites for managing, for without them we 
would have no purpose or road to follow, 


Beyond these bare minimum needs of direction, we must develop a 
means for fully conducting those activities for which we are responsible. 
This must come from "planning" the role to be played and pot to be played 
by Quality Control in its operation. Such statements do much to clarify 
with others as well as ourselves the Quality Control job more specifically 
than the broad terms expressed in the scope of the function. 


Basic concepts must be established to be used as a kind of motivat- 
ing theme. Among these concepts would be these; 


1. Use facts, not opinions 
2. let the facts supply conviction 


17 








3. Concentrate on management controllable defects rather 
than operator controllable defects 

4. Work on the major few defects rather than the many 
minor defects. 

5. Dollars saved are generally more important than 
percent reductions 

6. Maintain a balance of effort between chronic and 
sporadic defect studies, 


USE OF MANAGEMENT TOOLS 


All these items mentioned thus far - objectives, scope of function, 
role to be played, basic concepts - are identifiable as guideposts that 
mark the way for us. They provide us with the rules of the game. But 
now we must play according to them to achieve results and we mst look 
for tools now to make job success a daily feature instead of a grand 
occasion, 


The need for a solid, tight organization is one of the fundamental 
of such tools. Whether it involves a single engineer or a group of 
engineers, the problem of establishing people in clearly defined posi- 
tions effectively is the same. The organisation once established 
requires review often for it mst be altered to fit constantly changing 
conditions. Flexible, well defined organization of the function is a 
mst, 


Administrative tools are also fundamental for they steer each 
manager along the way. The creation of annual plans, quarterly reviews 
of past performance and future plans, expense budgets, cost reports and 
other aids like them provide guidance for an ambitious, balanced program 
of Quality Control activity. Lack of balance is one of the greatest 
hindrances to overall accomplishment there is! 


The creation of procedures and methods of handling the specifics of 
the function are most necessary for efficient operation. I believe in 
the philosophy that it it hasn't been written, it hasn't been well 
thought out. While this may not always be possible for all things, it 
is a fundamental need for most. Misunderstanding instructions is a 
common occurrence, yet could be greatly minimized by a concisely well 
written confirmation. Misunderstandings delay results and confuse; 
procedures expedite results and gain uniformity of action and inter- 
pretation. 


With our function as new to our company as it is, we have much 
training of all people to do so that we may satisfactorily do the 
Quality Control job. Verbal training alone and disjointed will never 
yield entirely satisfactory performance. Training programs for key work 
must be developed so that instruction is uniform and requires proof of 
reasonable understanding by the recipient. 


We use statistical tools in our work. These are the tools that tend 
to mystify all but ourselves - and even we experience distress with some 
of them. Use of statistics must be made wherever they will substantially 
assist in getting the true facts economically and enable us to offer 
better recommendations to the line. We should give these tools full 
expression yet be sure we speak to others in the laymen's language so 
that their benefits will not be stifled by the lack of technical under- 


standing on the part of line management, 
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The development of goals requires also the development of yardsticks 
and scoreboards with which to evaluate achievement. Whether these be 
quality performance, vendor conditions, dollar savings or any other form 
of study, we must create means of measuring performance. How else can 
work effort and accomplishments be reliably identified? Providing these 
adequately will give added guidance to both the function itself and to 
line management as well. 


Above all these tools, we must keep in mind that uniform raw 
materiels, reduced defectives and better quality going to the customer 
must result in dollar savings at the same time. No proposal will be well 
received by the line unless it has a cost consideration behind it. 
Dedication to saving dollars while improving quality conditions is our 
only acceptable and correct approach to serving our plants well. 


THE SPIRIT OF MANAGING 


So far, I have confined my remarks to those basic requirements and 
tools that can be and have been said over and over again. To me they 
are the surface story of managing. Without any one of them, the function 
will not realize its full potential. I can recommend them as vital for 
the capable managing of our function. 


But the key to real success in managing as I see it, lies in a more 
personal vein. I see the key in the "spirit" of the individuals 
participating in Quality Control work from the manager spreading through- 
out the organization. I say that because successful managing requires 
so much more than just the rules as they are so often expressed. 


Each of us must provide personal leadership and inspire confidence 
in all people with whom we are associated, Our enthusiasm must be 
catching. The spirit of achieving hard won success as a team together 
with other line and staff people must prevail. There is no place in any 
of our plants for a pole to carry the Quality Control flag higher than 
those of the other plant groups. 


We must sell our services to the line and convince them that they 
need our brand of service. If we are to be used correctly and 
effectively, we cannot demand but mst prove by our achievements that we 
are the best at our kind of work. 


We must be self starters] We don't wait for assignments, but dig 
in and do something about every situation as it comes up that is in our 
line. And we don't ignore it if it just doesn't fit us toa "T™. We 
discuss it with othersand only bow out if some other group is better 
equipped to handle it. In other words, we don't have and cannot have a 
wall around us that keeps us confined with blinders, Each plant has a 
team; Quality Control must be an active member! 


We serve the plant manager well by assisting Production in every 
way possible. The planning of a balanced program of activity is always 
a joint effort with Production and other staff groups that are concerned. 
Revisions in basic plans likewise are always done in consultation with 
the other groups so that smooth functioning of the team unit remains 
through discussion and communication, 


Good managers are not "fence sitters" either. No one expects rash 
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decisions made without due consideration and fact finding. But the 
managing of any function requires a positive, confident approach. We 
cannot serve the line adequately by continued reservations to issues. The 
line takes the risk in making decisions and they depend often on the 
recommendations of staff. Consequently, it is our job to suggest at 
least one proposal and preferrably several alternatives where possible 
to the line by the deadline time for a given decision to be reached. In 
carrying out our functions, we should always submit a recommendation 
along with any problem that arises. The plant manager, Production and 
other service groups must know where we stand! 


It is appropriate that we should also give attention to the problem 
that arises seldomly, we hope, but none the less does happen. That is 
the occasion when the line chooses to take action contrary to our 
suggestions. Under these conditions, a good manager really shows up! 


Surely we must believe our proposals to be correct when we give 
them. Moreover, we must continue presenting our point of view in a 
variety of expressions of facts until we have gained our point when we 
know our proposal to be right. But let us be certain all the while that 
we listen to and respect the other points of view; that we fully appre- 
clate the prerogatives of the line to make the decisions for which they 
are responsible; and, above all, support the decisions finally made with 
all of our capabilities! Such action on the staff manager's part is one 
of the most difficult to learn and one of the most necessary personal 
ingredients to have for consideration as a good manager, 


Just one other thought on this situation. Let us always respect 
the line management and their positions of pressure. If we respect 
ourselves and our profession, we must likewise show in full return a 
genuine respect for the line. When things don't go just the way we have 
recommended, it is far better to first look at ourselves and what we have 
proposed again before casting an eye in the other direction} 


THE OTHER SIDE TO MANAGEMENT 


With all of these thoughts on managing that I have expressed in the 
form of basic concepts, tools at our disposal, general basic rules of 
operation and the "spirit" so necessary for effective achievement, there 
is one last but far from least consideration that I would like to leave 
with you. 


Ours is a job of technical staff work. Quality Control is a new 
function operating in new plants with new men in our company with line 
men who, for the most part, are unfamiliar with the techniques of the 
function. These line men are eagerly looking to us to be of real help 
to them and we likewise want to be of help as quickly as we can. Such 
an atmosphere has afforded us every opportunity to put into practice 
many systems and practices so far with great potential for success. 


But much is at stake. The line is looking to us for competent, 
successful activities both technically and generally. They look to us 
to manage well in all things. 


This entire discussion has primarily been slanted to the technical 
side for a purpose, yet there is another side often taken for granted 
that never can be ignored as "in the bag". This other side consists of 
such things as reporting to work on time, a full 8 hours work for each 
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person in the group, breaks that do not exceed the plant policy, reports 
distributed on time, good human relations practices, etc. 


There are many items to be included, but I should like to stress one 
in particular. That one is the art of supervision. A plan or system 
can be installed in the plant that can save money and provide better 
control than ever before, yet fail because it wasn't supervised well. All 
of us can readily agree that this can be true, but so often we shrug it 
off as insignificant because “we don't let that happen to us". Other 
people have that problem, not us] 


My word here is one of caution, We are a new function. We know it 
is justified as a part of the manufacturing team. We know what great 
potential good can come from its practice, 


Let us not then ever feel complacent about this so-called “other 
side". Much of the good in the technical side will go astray in the 
eyes of the line and the people in the plant if these other side neces- 
sities are neglected. Almost any system well taught, well supervised 
and followed up is to be preferred to a better system that is not! 


SUMMARY 


We have covered generally a broad view of some management needs 
that have application in many fields other than Quality Control. Defining 
the function well and using various management tools properly applies to 
all groups. What I have tabbed the "spirit" of managing is perhaps even 
a way of life if looked at in one sense, There is no question about the 
need for the "other side" of management to stand as the basis for all 
technical activity. 


I have tried to stress that we must be especially mindful of con- 
ducting Quality Control soundly because we are so new to the manufactur- 
ing scene. In order to remain, we mst prove that our presence materi- 
ally contributes to a plant's progress toward optimum efficiency and low 
cost. We mst be "good" to achieve it! 


A good, balanced Quality Control Program is not an accident] It 
just doesn't "happen"! It must be planned} It must be well managedl} 
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STATISTICAL METHODS IN ENGINEERING DESIGN 


Enoch B. Ferrell 
Bell Telephone Laboratories, Inc. 


ABSTRACT 


Variability in a manufactured product will probably always be with 
us. We can live with variability if it is predictable, i.e., if it is 
self-consistent and stable. To achieve stable variability we need a 
proper coordination between the design of the product, the design of the 
process used to produce it, and the components and tools used in the 
process. The simple but effective techniques of statistical quality 
control are powerful aids in arriving at this coordination. 


-%eteoktXteo te koto ta t - € 


When we start to talk about the business of design, we should 
realize that it has several objectives. We should design a product that 
will do what we want it to do after it is made. We should design a 
product that can be made at a reasonable cost. We should design a 
product that will perform its desired function not only when new but 
also after considerable use or life. And we should design a product so 
that if something does go wrong with it we can get at the trouble to fix 
it. This last, which we might call "Design for Maintenance", in some 
cases reduces to the simple ability to remove one unit of an assembly 
and replace it with a new unit. 


If we study these various objectives, we soon discover that repro- 
ducibility underlies all of them. When we say we want to design a 
vacuum tube for a certain use, we really are not talking about "a" 
vacuum tube at all; we are talking about many vacuum tubes to be made in 
the future according to this design and we want all of them capable of 
performing the desired function. That is, we want interchangeability, 
which to the manufacturer means reproducibility. When we talk about 
“life", we mean not the life of a few tubes tested in the laboratory, 
but the life of many tubes which will be tested in service. Here again 
we are interested in the reproducibility of that life. And when we talk 
about maintenance, which very often involves replacement of piece-parts 
or sub-assemblies, we want uniformity in these replacement parts. 


If we accept the thesis that reproducibility is an underlying 
objective of design, we find it very difficult to separate the design of 
the product from the design of the process that is to produce it. 


We are not likely to get very far in the design of a new product 
and its associated process without wanting to turn to the laboratory or 
to the shop for more information. We want to know the strength or 
thermal expansion or some other property of a proposed material. We 
want to know the tolerance capability of some tool. So we plan experi- 
ments and make measurements. And when we start to interpret the result- 
ing experimental data we immediately raise the questions of credibility, 
reproducibility, precision, and the like as applied to the experiment. 
Some of the best tools to couple with our engineering know-how, and to 
help us answer some of these questions are the simple, but effective, 


statistical methods that have been developed by the Quality Control 
Engineer. 
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We sometimes hear that the methods of statistical quality control 
are applicable only after we are in large-scale production. Many of us, 
however, have witnessed the great assistance that these statistical 
techniques can provide in the initial setup and adjustment of our mass 
production processes. It is generally recognized that these techniques 
are of enormous value in the pilot shop. It is not generally recognised 
just how early in the history of a new product the application of sta- 
tistical quality control methods should begin. Let me cite a few 
examples. 


This first example goes back about to the beginning of my contact 
with statistical methods. At the time we were working on a glass-sealed 
switch. Our work resulted in the design of a product and a process for 
making it. This glass-sealed switch is a very simple device. It con- 
sists essentially of two plain permalloy reeds supported at their outer 
ends with their free ends projecting towards each other cantilever 
fashion with a little overlap. See Figure 1. When placed in a magnetic 
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Figure 1 - Glass-Sealed Switch 


field the free ends attract each other, come together, and make contact. 
The supporting structure is a glass tube surrounding the reeds, and 
sealed to the reeds at their fixed ends so as to enclose and protect 
them. One of our early problems was the size of the magnetic gap between 
the free ends of the reeds. How much should the reeds overlap and what 
should be the spacing through which they moved to make contact? We set 
out to determine an optimum experimentally, but found it very difficult 
either to produce or to measure a given set of dimensions. We also found 
our results influenced by the length of the reeds, the type of seal which 
held them and several other things which seemed to have a great deal of 
variability. It appeared that we would have to design the process before 
we could complete the design of the product. 


At about this point we arranged for another department of the 
Laboratories, acting more or less as a sub-contractor, to produce a small 
number of these switches on an experimental basis for our study. Their 
people prepared a set of tools, set up a process and made some switches. 
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After some experimentation on their part, they announced that they were 
ready to make experimental models. We gave them a set of dimensions and 
they made 48 switches. 


We received these 48 switches, measured certain characteristics, 
including the ampere turns required to close the contact, and plotted a 
statistical quality control chart of these measurements. See Figure 2. 
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Figure 2 = Operate Current. A study of 48 switches 
made under supposedly constant conditions, 
using the "Old Tool". 


Each point on the chart is the average operate current for four switches. 
The limits — the dotted lines — are computed by a standardized sta- 
tistical method. The location of the last point outside the limits on 
our control chart indicated that the last 4 switches were fundamentally 
different from the first 44. We went to our sub-contractor and inquired 
about this, but were assured that all 48 switches were just alike. We 
protested that our computations indicated a fundamental difference and 
he asked us to check our computations. 


We did check our computations, found them correct and set about a 
two weeks program of study in an effort to explain the outage. At the 
end of this time we had discovered that whereas the magnetic reeds for 
the first 44 switches had been cleaned by heat treatment in a hydrogen 
furnace, the reeds for the last 4 switches had been cleaned in an acid 
bath. This had a significant effect on the surface condition and hence 
on the stiffness of the reeds. We had detected the presence of an 
assignable physical cause for abnormal statistical variation and had 
actually identified the cause. 


We showed the sub-contractor our findings and the difference was 
readily admitted. Here is a very common pitfall. A change had been 
made in the honest belief that it would have no effect at all. But it 
did have an effect and the simple statistical analysis of actual results 
had shown that it did. 


We were now no longer interested in the last 4 switches. We re- 
plotted our control chart for the first 44 switches and found all points 
now inside the control limits. We sometimes describe this situation by 
the phrase "in control". The indicated tolerance, however, was some- 
thing like +25 ampere turns. As compared to an average of about 50, 
this was entirely too much variation for our purposes. Our 
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sub-contractor pointed out that although the variability was large, he 
thought he knew why. One of the supports in the assembly tool was a 
little weak and could be stiffened. Another part of the assembly tool 
obstructed the vision of the operator; and this could be corrected. 
"This should reduce our variability”. 


Our statistical analysis, however, had given us another piece cf 
information. The 44 switches appeared to be in control. Now when we 
have a state of control the variability is not due to one or two 
important causes, but to a great many unrelated minor causes, and the 
removal of one or two of them will not make any important reduction in 
the variability. This, in fact, is what the words “in control" mean. 

We recommended that he not spend time on these minor modifications in 

the assembly process. We said that in order to significantly reduce the 
variability this whole process must be heaved in the ashcan and a com 
pletely new process with a new set of causes for variation mst be 
designed. Our sub-contractor, however, still thought it would be worth- 
while to try to fix up the existing process. It is common at this point 
to have some discussion as to possible meanings of such phrases as “minor 
modification of the process", "major redesign of the process", and "brand 
new process". These semantic difficulties are seldom resolved at this 
time. 


During the next six months, the sub-contractor made several modifi- 
cations of the assembly tool and produced a considerable number of 
switches. A record of this in the form of a control chart is shown in 
Figure 3. The part at the left is a history of what he did before our 
discussion of the 44 switches. The last 11 points in this section 
represent the 44 switches. We have labeled this section of the chart 
"Old Tool", The next section of the chart which we have labeled "New 
Tool" represents his effort to modify the process so as to reduce the 
variability. 


Briefly, the upper chart is a control chart of averages. On it 
each point represents the average operating current for each successive 
sample of 4 switches. The line through the center is the grand average 
and the dotted lines are control limits. They are computed by well 
known, simple statistical techniques and represent the limits beyond 
which we do not expect a respectable process to go. You will note that 
there are many points outside the limits. Actually, these points repre- 
sent, for the most part, deliberate readjustments by the operator. He 
would make a small lot of switches; he would observe that the average 
operate current for this bunch of switches seemed to be low; and he 
would adjust the tool upwards. More than likely the next bunch of 
switches would possess a high average operate current and the tool 
would be readjusted down again. Hence, on this chart the control limits 
represent the inherent capability of the process and the outages repre- 
sent the human interference. The lower chart shows the variability or 
dispersion. Each point represents the variability or spread within a 
group of four switches. It has no points out of control. This means 
that the process in use had a stable inherent variability. The 
operator's readjustment had little or no effect on this. The process 
was inherently stable and this was its dispersion. The middle section 
of the chart indicates that by mans of the new tool the variability had 
been reduced by some 15%. This is not much of a reduction and actually 
the total amount of data available here is not enough to be sure that 
this improvement is real. 
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During the time that the sub-contractor had been working with the 
new tool, one of our engineers had been groping for a new assembly 
process. He proposed several different processes. Some reason for 
dissatisfaction with each of these was discovered before it was put into 
use. Finally, a new assembly process was conceived and tools for it were 
built. The two points on the chart at the beginning of the section 
marked "New Process" — the two points labeled "Calibration Test" — 
represent the first 8 switches made by this new process. With some 
optimism the indicated low dispersion was accepted as valid. The average 
of the 8 relays was accepted as indicating the actual tool setting, and 
a@ single correction on the tool setting was made. 


Switches represented by the rest of this "New Process" section were 
then made with the results shown. The control was good and the dis- 
persion had been reduced by a sizeable factor. The last little section 
at the end of the chart shows the effect of using a different magnetic 
material for the reeds with no other change in the switch or the process. 
The dispersion appears to be not affected; the average is affected only 
slightly. 


Let me review some of the things that this case history illustrates. 
First, it illustrates the ability to find assignable causes for vari- 
ation — even though, in this case, the assignable cause arose from a 
process change believed to be so unimportant that it was overlooked. 
Second, it illustrates the fact that if a process is essentially in 
control, tinkering with the process, modifying here, polishing it up a 
little there, will have no real effect on the inherent variability. 

When we combine these two, we realize that before you can evaluate the 
design of a product or a process you must bring the process into control. 
If, after it is in control, the process displays an intolerably large 
variability, you might as well heave it out and start over. You will not 
improve the variability by modifying the process. 


We have been talking here about variability and stability in the 
same breath. It may seem that the two are incompatible but this is not 
so. Tne instantaneous value of the voltage delivered by an oscillator 
has great variability — we say the voltage varies sinusoidally — but 
the oscillator may have very great stability. This means that the 
magnitude and frequency of the variation is constant. The concepts of 
variability and stability can be applied in combination also to 
resistance noise. The instantaneous voltage of the resistance noise is 
highly variable, but the average power at a given temperature, bandwidth 
and resistance is quite stable. When the statistician talks about 
statistical stability or when the quality control man talks about a state 
of control, they are talking about a variability which may be either 
large or small, but a variability that is consistent with itself. 


As a third lesson from the example look at the quickness with which 
the capabilities of the new process were established. There was no lost 
motion in demonstrating that the new process was an improvement. We had 
a good measure of the inherent variability of the old process to compare 
it to. 


I might mention that when our sub-contractor saw the new process he 
quickly agreed that we were not stretching the meaning of the word when 
we said the process was new. It bore no similarity at all to the process 
whose variability we had found undesirable earlier. I might also add 
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that since that time — and, I hope, possibly to some extent because of 
that experience — that sub-contractor has become a staunch supporter of 
the statistical quality control idea. 


Here is a second case history. We were working on some similar 
glass-sealed switches which contained little pools of mercury that wetted 
the contacts so that the electrical circuit was closed and opened by 
liquid mercury. At one stage during our work on these switches a lot of 
some 30 or 40 were made and their operating cleracteristics were measured. 
This was finished up late in the afternoon and the switches were put on 
life test to run overnight. I took the data from the measurements home 
with me that night and on my knee during a 35-minute ride in a commuters’ 
train I analyzed these measurements by use of a simple control chart. 

The next morning, almost before I got to my desk, the engineer who had 
made these switches and placed them on life test came to me with the 
statement that one of the switches had failed. It would not even run 
overnight. I asked him which one. He told me. I checked with my 
control chart, and found that it was one of those whose measurements had 
been out of control on the previous afternoon and gave him the identifi- 
cation of four or five other switches which were also out of control and 
might be expected to misbehave. All but one of these other switches 
failed before the day was out. Most of the remaining switches gave lives 
that were measured in weeks. 


This second example illustrates the fact that when a control chart 
says an item is fundamentally different from the bulk of its fellows, 
this difference is worth some attention. 


For my third case history let me move on in time to where we were 
setting up a pilot shop to make these glass-sealed switches at a rate of 
a few thousand a month. This rate of production did not warrant any 
fancy tooling-up job, but did justify making a set of simple, easily 
operated non-automatic tools. The design of the switch was such that a 
large part of it was fabricated by welding together small pieces which 
had been sheared off from wire or strip stock. We made a set of simple 
hand operated shears to cut off these pieces. See Figure 4. We could 
change from one piece-part to another simply by changing the die and 
stop in the shear. The shears looked simple, reliable, and easy to 
operate. As they were set up we sheared off 24 pieces of each type of 
piece-part, kept them in order, measured them, and plotted control 
charts. We also measured a random sample of 4 from each of 4 succeeding 
lots of 80 pieces. We found that all but one of these shearing oper- 
ations indicated a close approach to control and indicated a uniform 
spread of a little less than t 1 mil. The similarity of the variations 
of the different types of piece-parts gave considerable confidence in 
the stability of the shear and its suitability for a variety of jobs. 
Since we had determined its inherent variability by a study of nearly 
500 pieces — 40 from each of a dozen types — we had considerable 
confidence in this figure of t 1 mil. 


The departure of the average, in the case of each type of piece- 
part, from the intended dimension indicated that our set up error was 
also roughly t 1 mil. Accordingly, we marked these piece-parts on our 
design with tolerances of + 2 mils and prepared inspection gauges to 
suit. 


The one piece-part which did not show the close approach to control 
was a piece-part not really suitable for this type of shear. The ability 
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of the control chart to aid in determining the suitability of a tool for 
a job and in determining its inherent variability when used on a variety 
of suitable jobs is one of its great advantages. 


As a matter of expediency, in this case, we went ahead and used the 
shear on the piece-part for which it was not really suitable. But we 
did this with our eyes open and gave special attention to the selection 
and training of the operator who used it. 


This illustrates, I think, how a little bit of experimentation and 
a little bit of statistical analysis can supply us with tolerance figures 
for our design that are realistic and that fit the design and the process 
to the equipment available. 


As a final example, leteme go 2 little more modern. Quite recently 
we were building an experimental switching system such as might be used 
in a telephone office. The circuitry in this system involved some 2,000 
germanium diodes. These were obtained from several different suppliers 
in lots of from 200 to 800. As these lots came in, the diodes were 
measured. We, of course, had nothing to do with the manufacturing 
process and so our measurements had no effect on the manufacturing 
process. The purpose of our measurements was not even primarily an 
acceptance inspection, although in the end we used these measurements to 
do some of this. Our principal purpose in these measurements was to find 
out before we completed the design of our circuitry what variability we 
should expect in these diodes. We found that the distribution of back 
resistance, for example, was a badly skewed distribution which in the 
statistical language could be described as something like a Log-Normal 
distribution. The variability was so great that it did not make sense 
to talk about a "mean plus or minus" something. The limits could be 
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better described as half a megohm times or divided by 9. That is, we 
found diodes running all the way from 55,000 ohms to 4.5 megohms. w#ith 
this knowledge we designed our circuits to work with diodes of 50,000 
ohms back resistance. In two years of almost continuous operation we 
have had practically no trouble with diode circuits. 


This last case bears on the problem of the circuit man or his 
mechanical equivalent, the assembler. He admittedly has little or no 
control over the piece-parts that he is to assemble. This makes it all 
the more important that he should make a realistic evaluation of the 
variability that does exist in his piece-parts and should design his 
assembly so as to accept that variability. 


We have been talking about variability and stability. These are 
not new concepts. The experienced engineer nearly always takes them 
into consideration. He may not employ a formal language in discussing 
them. His own use of them may even be more or less subconscious. But, 
in the end, he usually does a pretty good job of insuring a satisfactory 
variability with reasonable stability. 


Two points, however, we would like to emphasize: First, these 
ideas can be formalized. There is a definite and acceptable language to 
use in discussing them. There are simple computational methods that can 
greatly improve and sharpen up our engineering judgment. Second, we can 
profitably use these ideas and analytical techniques in the earliest 
stages of development, even in research, so that when we arrive at a 
design for production we have a solid knowledge of what variability we 
can expect and a confidence in its stability. 
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APPLICATIONS OF STATISTICAL METHODS IN BUSINESS ADMINISTRATION 


Dr. Grant I. Butterbaugh 
University of Washington 


The subject matter contemplated by the title of thie paper is no- 
thing if not voluminous. It would take a series of sessions such as this 
one in order to make more than a slight introduction to its full dis- 
cussion. Without doubt the applications of statistical methods in busi- 
ness administration are legion. We have ample evidence of that in the 
writings of the past, but we can only speculate on the many unrecorded 
uses of statistical methods by the non-vocal or non-literary practition- 
ers. In this paper, therefore, I shall speak of the beginning applica- 
tions of statistics in business administration, the changing meaning of 
the term "statistical methods”, how scientific development has progress- 
ed, what the situation appears to be at present, and the direction in 
which research is pointed. Furthermore, being in the field of education, 
I feel it would be appropriate to make a few remarks on the basic educa- 
tional problem in business administration. I shall not go into the sub- 
ject of the design of industrial experiments ~- a vast field with a bib- 
liography of thousands of references; for a symposium on this subject 
see reference (1). 


As you know, the word "statistics" is used in different ways in 
everyday language and so the applications of statistical methods cover a 
wide field of endeavor known as business administration. One use of the 
word “statistics” is in reference to a collection or presentation of in- 
formation in the form of mumerical data. Another use is in reference to 
a method or technique for dealing with numerical data. In the two afore- 
mentioned senses Statistics is a very old subject. Statistical data have 
been found in the early writings of the Babylonians and Egyptians of some 
4,000 years ago when business transactions and accounting records were 
recorded on clay tablets and papyrus. Census statistics formed the basis 
for the administration of early states in their collection of taxes, in 
the drafting and provisioning of armies, and in the tmsiness of building 
vast empires. Many an unearthed temple corner-stone or collection of 
clay documents has yielded mte evidence of the importance of inventories 
of men, manitions, and food in the early administration of the church- 
state. One clay tablet in the British Museum originating about 1600 B.C. 
is inscribed with forecasts concerning the chances of a sick man's death 
or recovery; another with the probabilities of the success or defeat of a 
military expedition. These events were evidently of such a repetitive 
nature that it was possible to develop some form of statistical prognos- 
tication regarding them. At any rate, the history of record-keeping is 
the history of accounting and statistics, and all of this record-keeping 
lies in the realm of descriptive statistics. 


While the great preponderance of statistical work of the world has 
fallen, and still falls, into the category of descriptive statistics, the 
heart of modern scientific statistics is based on the theory of proba- 
bility. Here we encounter so-called inductive or inferential statistics, 
wherein generalisations are made from samples of data. Statistical in- 
ference based on samples involves the application of probability con- 
siderations to the sample data. This is what is commonly meant when we 
speak of “statistical methods". 


Statistical methods based on the theory of probability have been 
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developed mainly by statistically-minded mathematicians or mathematically 
minded statisticians during the past two centuries. These methods have 
been put to use in all true sciences and in the many arts by applied 
statisticians and statistical administrators. The application of induc- 
tive statistical methods in business administration is of comparatively 
recent origin compared with the application of descriptive statistical 
methods. Business Administration appears to be one of the last of the 
main areas to accept intensive application of probability statistics or 
scientific sampling. One can only wonder at the possible reasons for 
this phenomenon, since business administration has been in evidence since 
the dawn of history. 


Perhaps business administrators saw at first only the need to col- 
lect and record data as an aid to memory. Later on, the budget was born 
out of a desire to forecast operations and to set up standards of opera- 
tion. The development of budgeting brought on advanced accounting anal- 
ysis, which went from simple comparisons of absolute amounts of change, 
to the use of ratios, percentages, and index mumbers to express relative 
comparisons. Here analysis seems to have stalled. Although many trained 
accountants have worked out deep-cutting analyses of variations in costs, 
revemmes, profits, and financial position elements, the theory of prob- 
ability has not come into general use to explain to management the risks 
they take in making decisions involving such variations. Probability 
has not been a basic study in the education for business. Although 
businessmen take risks in all phases of their business dealings and in 
making business decisions, they ordinarily have not had the opportunity 
to learn how to use statistical methods based on the theory of prob- 
ability. This is largely the fault of education, of which I shall speak 
later. 


Variation is a fundamental characteristic of science, business ad- 
ministration, and of life itself. Variation may be obvious and inescap- 
able in some cases, and readily explained. In other instances it may be 
quite elusive, subtle, and not easy to clarify. We know that the mea- 
surement of any real thing can never be had with exactitude. There can 
be no human eye that can detect infinitesimal separations. Variation 
between sample measurements is therefore inevitable and sampling varia- 
tion or error mst be recognised as real. We recognise two types of 
variation in repeated measurements in business administration, ascribing 
them to two different sets of causes: 


1. Sent ere variation - brought on by human, mechanical, or mat- 
erial imperfections which may be sought out and removed if it is econom- 
ically feasible to do so. The improved measurement may not be worth the 
cost of the effort, but that is a matter of business judgment and de- 
cision. Assignable variation is akin to bias and bias results in in- 
accuracy - missing the target or true universe value. To the extent that 
@ sample result differs from the universe value, to that extent a sample 
result is inaccurate. One may set up a budget of revemes and expenses, 
of cperating results, or of financial position. Against this budget one 
can measure variation in the sample results. Again, standards of work 
output may be set for men, machines, or whole departments or plants 
against which comparison may be had through sample remlts. Accuracy 
can only be measured when a nominal value, a standard, a budget, or some 
other target is arranged. 





2. Random or inherent variation - comes about from the infinite 
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causes of variation in materials, methods, men, and machines. Random 
variation is capable of reduction but not of complete removal. This ran- 
dom variation or error can be measured and controlled but cannot be 
eliminated completely as long as sampling is used as a method for esti- 
mating the universe values. 


As one or both types of these variations can be expected in all 
sampling operations in business administration or anywhere else, it is 
the function of statistical sampling methods to eliminate bias, measure 
variation or sampling error, analyse such variation, and in respect to 
that analysis to provide sound bases for making certain types of statis- 
tical inferences regarding the true measures of the target universe. 
Business administrators should distinguish between the "target" universe 
value sought and the "sampling distribution” or "frame" value sought. 
Since we cannot ordinarily measure accuracy we can only do our best to 
eliminate the mititudinous types of bias in our statistical methods of 
collecting and analysing data. We can, however, measure precision or 
sampling error in the sampled estimate of the value sought. This pre- 
cision measurement gives an indication of the amount by which the esti- 
mate made from the sample deviates from the result that would have been 
obtained through infinite sampling, or complete coverage using the same 
methods, equipment, time span, and personnel. One can then express his 
degree of confidence in the sample statistics as estimates of the frame 
or sampling distribution value. Since business administrators make de- 
cisions based on evidence, it would seem advisable wherever possible to 
have an objective expression of the degree ox confidence in the evidence 
submitted. Scientific sampling methods give answers to some of the 
businessman's quandaries. 


The theory of probability underlies true scientific methods and 
techniques used by management in making decisions. Research has brought 
forth a vast reservoir of ideas for business use. This can be illus- 
trated by the development of non-parametric and distribution-free methods 
of estimation. The idea of ridding oneself of the assumption of normal- 
ity or knowledge of the shape of the basic universe distribution in 
sampling has an appeal. Getting to know the vast array of techniques and 
the usefulness and limitations of these mathematical methods is a broad 
challenge to the capability and perseverence of any statistician. One 
article alone had a reference list of some 1700 articles on non-paramet- 
ric statistics; this is a veritable library of ideas worthy of note. (2) 


Over the past sixty years one management movement after another has 
evolved dealing with the problems of industrial production and business 
administration. In 1911 Frederick W. Taylor published his book "The 
Principles of Scientific Management" (3) bringing forth principles in 
connection with controlled experiments made on metal-cutting techniques 
at Bethlehem Steel. Taylor summarised his concept of scientific manage- 
ment briefly as: 


1) The duty of management is to "gather together all -f the tradi- 
tional knowledge which in the past has been possessed by the workmen and 
then of classifying, tabulating, and reducing this knowledge to rates, 
laws, and formlae which are immensely helpful to the workmen in doing 
their daily work". 


2) Another duty of management is the scientific training of workers. 
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3) Management has the duty of co-operation with labor to make sure 
that the work is done in accordance with the principles of acientific 


management. 


4) It is the duty of management to make an almost equal division of 
work and responsibility between management and workers whereas in the past 
almost all the work and the greater part of the responsibility were 
thrown upon the men. 


Taylor gave birth to the Taylor System or movement which was shared 
by other important pioneers including Henry Gantt, Frank and Lillian Gil- 
breth, Harrington Emerson, Louis D. Brandeis, L.eP.Alford, A. Hamilton 
Church and others. Scientific Management brought forth a new concept of 
attacking management problems with analytical or scientific methods in- 
stead of "rule-of-thumb" methods. Industrial engineers and efficiency 
experts versed in the Taylor System ideas and principles applied scien- 
tific management in industry and administrative areas with the goal of 
increasing human skills as well as mechanical power. 


Although Business Administration appears to be one of the last of 
the main areas of human endeavor to apply statistical methods in the 
scientific sense, it certainly has made up for lost time during the past 
thirty years. The first attack was on problems concerning the control of 
the quality of manufactured products. We all know the modest beginnings 
of statistical quality control for a period of fifteen years after its 
discovery in 1924 by Dr. Walter A. Shewhart. We know how difficult it 
has been to educate business administrators (management), in the value of 
these newer uses of the old statistical principles. Perhaps there has 
never been a more intensive educational program in any field than the one 
started by the invention of the Shewhart control chart. 


The first book in the field was by Dr. Shewhart himself in 1951, (4) 
although it was preceded in 1928 by a book in which T. C. Fry devoted two 
or three pages to control charts. (5) In 1933 the American Society for 
Testing Materials produced the well-known "Mammal on Presentation of 
Data” (6) - a little book of 38 pages which was followed by a reprint 
with a supplement including the "control chart" in 1957. There have been 
mumerous printings of the mammal in the years following its introduction. 


The British Standards Institution brought out a mammal by E.S. Pear- 
gon in 1935. (7) This was the first publication of the Institution on 
the subject of statistical quality control methods. It was followed by a 
revision in 1942 by B. P. Dudding and W. J. Jennett. (8) 


The famous booklet of the American Standards Association known as 
American War Standards (9) (10) served as principal references during the 
intensive quality control training program of the World War II, under the 
direction of the Office of Production Research and Development of the War 
Production Board. These books and mamals together with a popular book 
by Leslie EB. Simon in 1941 (11) were the fundamental literature of 
thousands of persons from industrial establishments and colleges who 
received training in the 8-day courses of that early period of SQC his- 
tory. 


Although statistical quality control and acceptance sampling tech- 


niques for the inspection of incoming materials took off to a slow start 
from 1924 to 1942, the momentum gained during the war period produced a 
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veritable atomic explosion of ideas and techniques for application of 
the theory of probability and sampling to production control once the war 
was over. The literature increased until no man alone could keep up with 
it all. (12) Im many hundreds, even thousands, of periodicals, books, 
brochures, and company mammals, one may come upon explanations of appli- 
cations of some phase of statistical theory or methodology for use in 
industry. This statistical education has spread throughout the world 
until today it is a backward industrial country indeed where men are not 
making use of these scientific sampling methods. (13) 


One of the first papers appearing on the use of statistical quality 
control methods in the field of administrative applications was delivered 
before the Royal Statistical Society of London by W.F.Newland and E.E. 
Neal in 1939. (14) There had been many papers and a few books published 
during the fifteen years after the invention of the control chart but 
whether they were in the field of textiles, ceramics, steel, automobiles, 
electrical or mechanical engineering, or some other area, they centered 
on the absorbing subjects of theory and methods in the economic quality 
control of manufactured product. 


One of the earliest articles devoted to administrative applications 
of statistical methods was by W. Edwards Deming and Leon Goefrey in 
1941. (15) Although it is a natural step from the use of the control 
chart in mass production of mamfactured products in the factory to the 
use of such charts in mass production of the products of strictly human 
operations, someone mst be the first to try an adaptation and show that 
it is feasible. Por those who may not have heard of this first important 
use of the control chart in control of human operations, it may be stated 
that the Bureau of the Census was reported to have saved approximately 
$263,000 through the use of sampling methods and control chart techniques 
in the inspection of card-punching and verifying operations connected 
with the 1940 census. Something like 51 million cards were sampled out 
of over 175 million cards key-punched. Percent defective control charts 
were kept for about 475 clerks who qualified for inspection on a sampling 
basis. The article gives details of how the program of verification of 
clerical work was carried out. 


A few years ago the use of statistical methods in the control of 
manufactured product was unthought of. Today it is just as sensible to 
control the quality of personnel product from the point of hiring, 
through daily work, to the finished product of services rendered. A 
service organisation might well take as its definition of quality con- 
trol: 


"Quality control is a system for co-ordinating the quality main- 
tenance and quality improvement efforts of the various groups in 
the organisation so as to enable production at the most economic 
levels which allow for full customer satisfaction." (16) 


We may hesitate to apply such controls because it is difficult to 
deal with “machines” that can talk back, become balky or sulky, cause 
work stoppage or delay in production, or even walk off the job. Because 
the application is difficult does not justify avoiding it for that is but 
an excuse and not a reason. Man can do practically anything he really 
wants to do. Capability studies have been made in a rough sort of way 
from the hiring of the first hired hand, or rather from the first pur- 
chase of the first slave, tut scientific capability studies are in their 
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infancy. We lack knowledge of proper standards, particularly standards 
of mental work. Though a good deal of progress has been made, we lack 
adequate means of readily testing in advance whether a human being has 
the capability to turn out a required quantity of mental and/or physical 
work to a definite degree of accuracy and consistency required. It 
might be said, we are also short in our ability to set up standards of 
mental work. The adoption of administrative quality control mst wait 
on the installation of a system in an office. 


It was 1942 before the necessity of reducing waste in labor, mater- 
fals, and time of producing quality products gave a real impetus that 
generated the statistical quality control movement. But it was not long 
before the wide potentialities of the usefulness of SQC could be seen to 
extend to business administration or administrative applications as well 
as to the industrial areas where it originated. 


Credit mst be given to W. B. Rice for early articles on the use of 
statistical quality control methods in business administration. In 1943 
he discussed the adaptation of quality control techniques to office and 
financial management problems. (17) He gave a practical example of how 
office overtime might be controlled by the use of control charts. The 
use of control charts in the office as well as in the factory was advo- 
cated. 


Another early article by Rice followed in 1944.(18) The author 
dealt with the use of control charts in relation to cost analysis. He 
also discussed the control of certain expenses with warning of changes in 
the nature of the population samples. These articles by Rice and others 
about this time appear to mark the beginning of a real interest in the 
application of probability methods to business administration problems. 
It was not until 1950, however, that a book devoted exclusively to the 
use of probability statistics by management was written. (19) This was 
26 years after the invention of the control chart. Phis book by R. K. 
Mueller is intended to spur the imagination of management personnel to 
consider using statistical control in their business, if it has not al- 
ready been used, and, if it has been started as statistical quality con- 
trol alone, to extend applications of these methods into more adminis- 
trative phases. This is not a book on statistical techniques but rather 
one which presents an overall survey of the applications of probability 
statistical techniques from a practical management viewpoint. It tells 
how to use a statistical control program in the organisation and gives 
helps for introducing the technique, for selling it to top management, 
supervision, employees, suppliers, and custcmers, and for administering 
the program once it is started. 


It is most interesting to follow the development of a new idea such 
as the use of statistical methods in industry to see how long it takes 
for it to take hold. In this field there was very slow development until 
the war stimlated statistical methodology in industry, but in 1945 it 
seemed as though the dam of secrecy and occupation with war work had 
broken and men became vocal and literary. The flood of articles, papers, 
mamuals, books spread over the world. 


Out of World War II there developed the application of the scientif- 
ic method to the over-all operations in hand. The problems of the de- 
fense of London from air attack and the urgent problems connected with 
gabmarine warfare and bombing brought forth the application of teams of 
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brain workers to attack problems as a whole. This was called "Operations 
Research", or "Operational Research". Operations Research was carried 
over into peacetime as the application of the scientific method to the 
overall operations of a business. Operations Research has a broader 
meaning than plant operation or sales department operation. It looks at 
problems from the viewpoint of economic value - total cost, profit, and 
general effectiveness of a program. Operations Research is not statis- 
tics (history), it is an experimental science, with the goal of active 
control of future operations before it. In the application of Opera- 
tions Research extensive use is made of the mathematics of probability 
and statistical methods of analysis, although higher mathematics is not 
necessarily required to solve Operations Research problems. Sampling, 
analysis of variance and covariance, control charts, significance tests, 
are valuable tools when needed in this work. Sometimes higher forms of 
mathematics are required and we note the use of the Waiting line theory, 
the Search theory, the Game theory, the Communications theory, the Monte 
Carlo method, or some type of mathematical programming such as linear 
programming or dynamic programming. In today's forward rush toward 
scientific business administration techniques there has been a tremendous 
development in the use of mathematics and electronic data processing 
machines. Take linear programming as an example of research in this 
area. Linear programming is a method of optimizing a linear function 
when the variables are subject to a set of linear constraints. Such a 
mathematical method might be useful in the administration of scheduling 
production and shipments, for instance in a transportation problem con- 
cerned with the manufacture of a product in a half-dozen plants and dis- 
tribution to sixty warehouses scattered throughout the country; or the 
problem of finding the best order for traveling salesmen to tour a number 
of cities so as to minimise the total distance covered. According to 
Carl E. Noble (20) = the graphical method works for two independent vari- 
ables but an algebraic method, called the Simplex Method is used for 
solving problems involving from 4 to 20 or more independent variables. 
Thus, it may be necessary to use high speed computers to solve the prob- 
lem. There are countless statistical methods being evolved by research 
workers for eliminating arithmetical computations, some of which are 
discussed by W.R. Vogel. (21) 


Although interest has been growing rapidly in the potentialities of 
mathematical programming techniques in decision making, and in the past 
six years an extensive literature has been building up as indicated in 
an article by G.B. Dantsig, (22) it is to be stated that as yet linear 
programming is not widely used in industry, though there have been a 
number of very successful applications of the method thus far. 


In the newer school of scientific approach to administrative prob- 
lems the business scientist will: 


1) Develop a mathematical model or quantitative hypothesis for the 
activity to be predicted. 


2) Compare the deductions derived by logical manipulations of the 
abstract model against adequate observational data in controlled experi- 
ments to find agreement between abstraction and reality. 


3) Conduct critical tests of the overall theory in which predicted 
substantial effects are observed. 
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The business scientist will make use of results obtained and re- 
ported by earlier writers on similar problems elsewhere. That is one 
reason why searching the literature is an important primary step in 
successful scientific work. It enables one to learn of the successes 
and failures (although failures are not as readily reported as successes) 
of others, the problems faced, conditions under which methods were 
applied, and the techniques and methods used. 


To get an educational toe-hold in this field of study one might read 
"Introduction to Operations Research" by C.W.Churchman et al. (23) and 
meanwhile read as "warm-up" material the papers by W.E.Alberts (24) and 
Harlan D. Mills. (25) 


For a person who wishes to learn something about linear programming 
without running head-on into higher mathematics, I suggest three short 
articles ahead of those mentioned under (20), (21), and (22). These are: 
H.T. Schwan (26), A.W.Tucker (27), and A.C.Rosander (28). These last 
three articles will be found in the March 1956 issue of Industrial 
Quality Control. 


The need for more than the usual high school and college allotments 
of mathematics in order to understand the discoveries in statistical 
methods research in the field of business administration, is nowhere more 
evident than in such articles as "The Inventory Problem” by J. Laderman 
et al., (29) and "The Use of Mathematics in Production and Inventory 
Control” by A. Vassoni. (30) There are many articles in this area, a 
— which are given in references(31), (32), (33), (34), (35), and 

36). 


"It is becoming increasingly apparent," Mr. Vassoni states, "that 
some new scientific and mathematical theories are now being developed in 
this country which, in combination with the remarkable performance of 
electronic data processing machines, will bring about significant changes 
in current managerial techniques." In the course of studying production 
and inventory control, Mr. Vassoni reports it was repeatedly necessary to 
explain his mathematical theory to operating personnel, many of whom have 
had little or no formal training in mathematics at the college level. 

One author, T.M.Whitin, (36) says, "The next few years should bring about 
much additional research which will help decide to what extent industrial 
application of the various types of inventory control systems will be 
profitable. Contrary to the situation in the natural sciences, univer- 
sity research in the field of inventory control is far ahead of current 
practice.” 


Unless I am completely incapable of diagnosing the present situa- 
tion, it will be several years before any but a few colleges of business 
administration will have courses of training in management, accounting, 
marketing, production, or any other major field in which articles such 
as those cited pertaining to the inventory problem will be assigned as 
required reading. The perusal of any college catalogue will disclose 
that students in colleges of business administration are not required to 
have the mathematical background to enable them to cope with the advanced 
and ever advancing literature dealing with such problems. And neither, 
I might hasard an opinion, are present day business administration 
faculties required to have such training. These startling words may be 
news to those not in close touch with the situation, but I believe them 
to be a statement of the truth of the present day college situation. 
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Colleges of Business Administration do not, in general, require any 
mathematics beyond high school algebra and geometry as prerequisite to 
courses. These high school subjects are almost entirely forgotten, and 
skill in them reduced to sero, by the time students enter courses in 
college in which such subjects might be of some use. In this country 
college mathematics has been considered as unnecessary to the education 
of a future businessman as a foreign language. A quarter or semester 
course in elementary statistics is generally required of all business 
administration students, a course which is taken with some fear and 
trembling and much apologising for a lack of mathematical aptitude. 
Students can all read and write after a fashion but for the most part 
are grossly inept, if not downright illiterate mathematically. It will 
not be possible for the brilliant research being done to be taught in 
undergraduate courses in colleges until the entire educational system is 
changed to include mich more training in mathematics in the entire 
twelve-year period leading up to collezs. The fundamental place to start 
this radical change is in the home. Th:re will need to be considerable 
education of parents to encourage the study of mathematics by their 
children, and to insist that teachers all along the line work their 
children harder and longer. At present the teaching of courses dealing 
with mathematics in any form other than simple grocery-store arithmetic 
is as painful as building a house on a foundation too weak to support 
the superstructure. 


I am not arguing that higher research into the use of statistical 
methods in business administration or administrative methods should slow 
down. Far from it, for that would be folly, indeed. I simply want to 
put in a strong word for more mathematics and statistics in the curricula 
of grade schools, high schools, and colleges. Mathematics is the lang- 
uage of mmbers. However, we treat mathematics in this country as though 
it were a language we never expected the businessman to make continous 
use of in the future. We do not learn enough of it over a long enough 
period of time. It may be that people in general have not realized that 
the world of business and industry has changed and now requires greater 
mathematical understanding for real progress. It may be because of the 
overcrowded conditions in schools due to the current philosophy that 
every child is entitled to a high school education, and should be in 
school until 18 years of age, that we find mathematics being reduced in 
importance for the sake of the mentally weak. Because mathematics is 
not entertaining like TV and fun like tossing a basket-ball around, we 
introduce many softer courses of instruction adopting new philosophies 
to rationalize our actions. Thus we may advocate little or no homework 
and permit the students to sidestep courses dealing with data, formulae, 
and computations. Perhaps what we need in this country is an entirely 
new method of teaching arithmetic. Arithmetic might be considered easy 
in time, as easy as English had one been exposed in youth to the method 
taught at the Trachtenberg Institut in Zurich, Switserland. (37) It is 
little wonder that it becomes more and more difficult or impossible for 
the college student and the businessman to read advanced articles in the 
trade and professional journals of the day; they are just not equipped 
to do it. 


A mach more down-to-earth and readily understood article on inven- 
tory control, that is, one that would be more likely to be read and used 
in practice, and find a more receptive audience today than those listed 
in references (29) to (36), is one by W.F.Hoehing. (38) Although this 
article applies the theory of probability as exemplified in the 
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cumlative Poisson distribution, the mathematics are not beyond a second 
or third course in Business Statistics, and thus there is a reasonable 
chance that a few of the current business administration students might 
learn about it, understand it, and eventually use the method. Even more 
readily understood articles pertaining to inventory control and sales 
forecasting are those by Frances B. Newman (39) and Charles A. Bick- 
ing. (40) The latter article shows how the control chart can be used 

to assist management obtain internal control and gives examples covering 
1) merit rating of production personnel, 2) trend in manufacturing cost, 
3) industrial lost-time injuries, and 4) forecast of sales volume. In 
another article (41) Bicking strongly supports the adaptation of statis- 
tical methods to administrative applications and managerial problems. 
Here he brings out the applications pertaining to inventory control in- 
volving twelve products distributed from four warehousing points; an 
application pertaining to the rating of technical personnel; and one in 
the analysis of indirect expense as a percent of sales. Another "easy- 
to-read" article in the same vein is by Guy G. Parkin. (42) These last 
five articles are not likely to discourage the average business magasine 
reader if he has had an introduction to statistical theory and the 
ambition to learn new methods of dealing with administrative problems. 


Mathematical research development in marketing may have had its in- 
spiration in such early papers as the one by W. Edwards Deming in 
1941. (43). Here was discussed the need for directing the collection of 
data for marketing research towards the identification of causes of in- 
creases or decreases in sales. Benefits to be derived through the 
application of Shewhart methods of quality control were discussed with 
features of the plan. The part sampling plays in marketing research was 
presented with an outline of the main aspects of the statistician's job. 
Also, an original outline was given of two principal types of statistical 
problems and the relationship between the evidence, degree of belief in 
the prediction based on the evidence, the action to be taken, and the 
possible consequences of the action. 


An early milepost along the road of application of scientific sam 
pling in marketing research was by Theodore H. Brown, in 1942. (44) 


One comes across an occasional article utilizing scientific sam- 
pling in the field of marketing such as M.E. Brunk and W.T.Federer. (45) 
This article pertaining to research designed to measure consumer wants of 
@ commodity and reporting on some 34 experiments and results thereof 
indicates the direction statistical research is taking in this area. 

It is evident that the universities are not developing research in the 
marketing area as rapidly as in production and inventory control accord- 
ing to M. Anshen. (46) Anshen states "in this day marketing budgets 
involve commitments on too large a scale to permit executives to con- 
time to accept historical patterns in a management world in which 
rational exploration of alternative strategies and planned optimizing are 
becoming standard operating procedures.” These words indicate that per- 
haps we are on the verge of a marketing literature that will include 
rules of general applicability or a method of research that will provide 
answers to such common questions as: 


1) How mech money should we spend on advertising next year? 


2) What is the best division of the total advertising budget among 
the various available media? 
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3) How can we make rational decisions with respect to complementary 
budgets for advertising and other sales activities in local areas? 


Today the rules for making decisions in setting advertising budgets 
come under the heading of: 


1) A fixed percentage of last year's sales (or this year's sales 
forecast, the percentage being determined by past practices or the 
pattern in the industry; 


2) A fixed mmber of dollars per unit sold last year, or forecast 
to be sold this year; 


3) A fixed-percentage rule modified by recent profit performance and 
short-term profit forecasts. 


The reason for this situation is lack of knowledge on how to estab- 
lish direct cause-effect relationships between sales promotion outlays 
and sales to ultimate consumers. The above three questions do not 
appear to be answerable through existing methods. However, with all of 
the writing on the advantages of scientific methods many marketing re- 
search men still go no further in their analyses than the use of rela- 
tive comparisons in percentage, using methods from which it is in- 
possible to measure the error of their sampling results. 


Work measurement is an area in which statistical methods have been 
developing. This field is closely allied with industrial engineering, 
as are statistical quality control, operations research, and cost accoum- 
ing. Work measurement means a complete study and description of an 
operation including selecting appropriate characteristics of the opera- 
tion to be time-measured, improving the methods and procedures where 
possible, setting up a control system for scheduling, and in the case of 
incentive standards, controlling pay whereby countable output can be 
related to time input. In statistical quality control, experimental 
design, and operations research, powerful techniques have been developed 
that can be used in "work measurement". For one who wishes to learn 
more about this area of statistical methods in business administration, 
reference should be had to the articles by C.L.Brisley, (47)(48) and then 
one or two books such as "Work Sampling" by R.M.Barnes (49) or "Work 
Measurement" by A. Abrussi. (50) A more recent article, that brings out 
the usefulness of various mathematical methods such as queueing models, 
linear programming, and mitiple regression, is by J. M. Allderidge.(51) 


Statistical methods reach into the business administration of 
various types of endeavor such as department store operation. An easily 
understood article by C.S.Brinegar (52) brings out many interesting 
aspects readily applicable. Fred Streit (53) outlines the program used 
in establishing and maintaining quality standards in a mail order busi- 
ness. For students of sampling, it is interesting to compare discussion 
of sampling in the public utility field in "Some Theory of Sampling” 
by W. Edwards Deming, (54) and the article "Statistical Methods for 
Appraising Public Utility Property," vy E.T.Magruder. (55) These show 
advances being made in the practical use of scientific sampling design 
in business administration. A utilisation of the actuarial application 
of statistical information for maintenance of airplane engines predicts 
the far-reaching uses of probability statistics. Methods that are 
working daily to the advantage of business are illustrated by 
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Allan M. Hall and W.W.Wilcox. (56) (57) 


We come to two principal areas of use of statistical methods in 
business administration which are developing rapidly and will contime to 
expand until they are common to all business. I refer to the use of 
sampling methods in auditing and the use of statistical methods in 
clerical work of all kinds, including accounting. 


It is fascinating to follow a field of endeavor such as accounting 
from its inception in the dim past when temple records were kept on clay 
tablets with a stylus in cunieform characters dow through the centuries 
to the modern application of scientific methods and high speed electric 
machinery. We learn that double-entry bookkeeping had its origin in the 
llth or 12th century as far as anyone knows, and the first printed word 
on this subject still in existence is in a book by a Franciscan monk in 
1494. (58)(59) But it was not until after the publication of "Auditing" 
by L.R.Dicksee in 1892 that auditing began to take on the modern aspect 
of a profession. (60) What appears to have been the first approach to 
the use of probability sampling in auditing work is the article "The 
Efficacy of Tests," by Leo A. Carman. (61) Carman presented a table 
which seeks the mathematical determination of the relative degree of 
probability of discovering one out of a specific number of fraudulent or 
false items included in a larger group, by the examination of samples of 
various sizes. The underlying idea was that when a single false item is 
disclosed a discovery of sufficient value has been made to justify a 
completely new consideration of the sampling process with the possible 
alternative of a 100 percent check. 


Let us see how this pioneering approach of Carman in 1933 was 
carried forward in succeeding articles ami books. R.H.Prytherch in 1942 
discussing the problem of test checking states, "Judgment mst be used 
to determine the extent to which various parts of the work mst be 
checked and internal control is a factor in these decisions." He out- 
lines the statistical rule as to the reliability of samples and its 
application to test checking reproducing the table presented by Carman 
nine years previously. (62) 


The same Carman table is mentioned again (p. 21) in 1950 by L.L. 
Vance in his book "Scientific Method in Auditing.” {63) This book was 
a pioneering effort to introduce probability sampling in a field which 
has been using judgment sampling methods exclusively from the inception 
of auditing as a profession. Again, Carman's table was mentioned by 
Howard L. Jones in 1952, and in 1955. (64) Once more this table is 
brought up by Vance and Neter (pages 100-101) in 1956. (65) This 
fabulous table bobbed up again in an article by H. Arkin in July 1957, 
wherein Table III is credited to Carman with his idea of using the 
theory of probability in auditing work. (66). Evidently the idea of 
scientific sampling can no longer be ignored in the field of accounting 
and auditing, though it is going to be a long struggle to convince the 
old school of accounting practitioners of the practical efficacy of these 
new techniques. The auditor mst always be prepared to interpret the 
results of sampling and decide upon consequent action. His judgment will 
always be required as an expert in the subject matter to state his prob- 
lem with exactness, decide what is a suitable frame for sampling, what 
the frame is, and how to obtain the information from any unit of the 
frame. This is not the province of the statistician although he might be 
of help in the decisions. To quote from the article by Arkin just 
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mentioned; 


"In general, the accounting profession has not fully realized that 
during the past 50 years a body of knowledge generally referred to 
as sampling theory and practice has been developed on a scientific 
basis. This sampling knowledge has been widely applied in all 
fields. Wot only has it found wide acceptance and application in 
the sciences, tt in industry and business as well. Since auditing 
is looked upon as an art and not a science, auditors have not 
searched out these methods." 


Following up the first publication in 1950, (63) another book of 
larger proportions, namely, "Statistical Sampling for Auditors and 
Accountants" was published by L.L.Vance and John Neter in 1956. (65) 
This is truly a pioneering venture to introduce scientific methods into 
the realm where judgment sampling has always held complete and unob- 
structed sway. (See book review in The Accounting Review by J.L. 

Roth.) (67) Today it is becoming common to find articles dealing with 
statistical procedures and probability sampling in auditing and account- 
ing work. One such article by A.W.Patrick (68) briefly explains the use 
of the control chart idea and advocates the expansion of cost accounting 
to include interpreting the significance of variations. An important 
basic article addressed to accountants advocating the exploration of the 
possibilities of adapting statistical techniques to accounting problems 
is by R.M.Trueblood. (69) 


Two fine papers worth reading by anyone interested in the applica- 
tion of statistical methods in auditing and accounting are those by 
A.C.Smith (70) and Herbert Arkin. (66) Mr. Arkin states reasons for 
failure of statistical sampling method to permeate the field of auditing 
more quickly; they stem from the fact that the auditor's sampling problem 
involves the fact that the auditor does not place sole reliance upon or 
base his final decisions exclusively upon the results of a single test. 
The evidence obtained from each test is evaluated together with the re- 
sults of other tests and other types of examinations and explorations. 
In this respect the auditor's sampling operation differs greatly from 
that encountered in other fields. Sampling techniques mst be designed 
to meet the specific need of the auditor. 


The list of articles elucidating statistical methods in auditing and 
accounting control is growing with every issue of The Journal of Account- 
ancy, The Accounting Review, The Journal of the American Statistical 
Association, Industrial Quality Control, the New York Certified Public 
Accountant, the National Association of Cost Accountants Bulletin, to 
mention a few journals. Rather than list many articles it is suggested 
that the interested researcher should cover the indexes of these six 
journals for the past seven or eight years and also scan the indexes of 
the last half-dozen ASQC Conference Transactions, and ts identify 
articles in this area. By keeping up with the current literature in 
these important publications, one can watch the progressive development 
of the adaptation of statistical methods to auditing and accounting. 


To get a start in the right direction, it is suggested for those 
who are more interested in the application of statistical methods to 
clerical work control and accounting procedures than they are to audit- 
ing, that the following articles be read: W.F.Buhl, (71); Howard L. 
Jones, (72)(73); and A.C.Rosander, (74)(75). In these latter papers 
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Mr. Rosander describes the use of probability statistics in the confirm- 
ation of accounts receivables, the aging of accounts receivables, the 
control of payrolls, the control of inventory, and the control of costs. 
He discusses sampling in the estimation of the condition of physical 
assets, in settling inter-company accounts, and work sampling as a basis 
for cost analysis. 


Regarding cost accounting and cost control there are several impor- 
tant articles readily at hand: Carl E. Nobel (76)(77)(78) and EB. W. 
Gaynor. (79) There are, of course, other articles tut these are good as 
an introduction to the area. 


One could not complete even such a brief recital of the literature 
as this without mention of several additional individuals who appear as 
pioneers in the application of statistical methods in business adminis- 
tration, with particular reference to administrative applications. 

Dale L. Lobsinger, a past-president of the American Society for Quality 
Control, and first chairman of the Administrative Applications Division, 
is well known for his work in quality control for the air lines. An 
article such as "Some Administrative Attributes of Statistical Quality 
Control,” (80) should open the eyes of many persons to the positive bene- 
fits that have accrued to management through the use of these techniques 
in gaging production efficiency and accurately determining facility cap- 
acity under fixed conditions. Some of the areas in which statistical 
quality control techniques are used in the air lines are: accounting 
work, ticket sales, space control, food ordering, aircraft delays, air 
freight billing, aircraft maintenance, inventory control, and interline 
billing. In regard to the last-named type of work, reference should be 
had to W.G.Dalleck. (81)(82) These articles describe how modern scien- 
tific methods have been adopted by the airlines (and railroads too) thus 
cutting clerical costs for the benefit of owners, management, and the 
public. 


And finally, a few papers dealing with statistical control of cler- 
ical operations may give the reader the courage to attempt application 
of these modern methods in his own office. I shall merely list the 
names of the authors and reference may be had to the articles in the 
bibliographys 


J. M. Ballowe (83), B.B.Murdock (84), E.W.Gardiner (85), R.B. 
Shartle (86)(87), Ruth A. Hanna (88), A.L.Hart (89), R.B.Selover (90), 
D.J Blackwell (91), and A.C.Rosander (92). 


It would be possible to list many more articles pertaining to the 
application of statistical methods for the control of clerical opera- 
tions, quality control in the office, for the control of large scale 
invoicing, checking vendors’ invoices, checking tax returns, improvement 
of clerical work quality, and so forth. I apologize to everyone who has 
contributed to this important field and whose paper has been omitted. 
The papers included and the importance of the companies represented by 
the authors ought to convince any present or prospective businessman or 
educator that we have not only come a long way from the ancients in our 
record keeping but have devised wonderful methods for analysing the re- 
sults of our efforts. Business Administration is well on its way in the 
application of scientific methods in decision making and no man can say 
definitely how rapidly it will develop in the near future. It would | 
certainly seem that the American Society for Quality Control can play 
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a very important role in the stimulation of the study of mathematics 
and science not only by the young in all levels of school education tt 
by adults who wish to keep abreast of the developments of this scien- 
tific and electronic age. 
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MACHINE CAPABILITY VERSUS PRODUCT TOLERANCE 


Edward Oakley 
Delco-Remy Divisiorf 
General Motors Corporation 


In recent years more is being said and written about Machine Capa- 
bilities. Just what do we mean by "Machine Capability" or "Capability 
Analysis"? What do we mean by "Process Capability"? And more recent- 
ly a new term has entered the picture, "Machine Reliability." What is 
its meaning? 


It might be well to make the distinction so that we may have a com 
mon understanding of each of the terms. 


Let's analyze the meaning of each of the above in layman's words. 
1. Process Capability 


The accepted meaning of this is that a processing method which 
includes the man, machine and material is being studied to de- 
termine total variability and process stability, usually termed 
"state of statistical control." Invariably, time is an impor 
tant factor to consider because changes in quality level will 
occur as tools wear or are replaced; or as the man makes cor 
rective adjustments; or when materials are changed; or when any 
combination of the above three changes significantly. 


2. Machine Reliability 


This is a new term which has entered the picture in recent 
years. Primarily, it has to do with the long term aspects of 
machine longevity and total expense of maintenance during its 
total life. Due to the records that must be maintained, this 
particular study will require close coordination between Qual- 
ity Control, Maintenance and Production. Properly applied, 
this study can be very valuable in making comparisons among 
different makes of machines doing comparable work. 


3. Machine Capability Study Or Analysis 


The accepted meaningeof this is that a machine or fabricating 
device is being studied under controlled conditions to ascer- 
tain natural or inherent variation. That is, the man is re- 
stricted from making any tool adjustments and the quality of 
the material being processed is known to be of a given quality. 
Tool wear and other influencing factors are also considered. 
Primarily we are concerned with what the machine or device can 
do over a stated period of time and number of pieces of work. 


This paper deals with the preliminary planning necessary in making 
accurate decisions regarding machine capabilities. The intent is to 
show how a machine capability can be made, using basic S.Q.C. princi- 
ples, fortified with sound logic, and applied with common sense. 


Anyone whose full time is devoted to Statistical Quality Control in 
a competitive business finds that decisions must be made quickly and 
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accurately. This is particularly true when studies are being made in a 
machine builder's plant, or in a manufacturing department. The people 
concerned with the problem want the answers "now", and not some time in 
the future. The writer hopes that the experience we have gained in 
machine capability studies at Delco-Remy Division, General Motors Corpo- 
ration, will help others in Statistical Quality Control to be more ef- 
fective in their respective companies. 


A case history of an actual problem is presented for your study. 
The planning steps and problems encountered are listed as a guide to 
anyone concerned with a similar problem. 


Although some discussion will be devoted to the measurement of dis- 
persion, it is felt that anyone attempting a machine capability study 
should know before hand which method of statistical analysis will be 
used, 


Suffice to say, the more common and most widely used methods of 
measuring dispersion are: 


1. Frequency Distribution (Tally) 

2. Calculating Standard Deviation 

3. Probability Paper 

4. X & R Chart 

6. Span Plan - by Leonard A. Seder 
7. Lot Plot - Dorian Shainan 

8. Analysis of Variance 


The author prefers to use 1, 3, 4 and 5 above. 


We might begin by first considering some of the preliminary steps 
necessary prior to making a machine capability study. 


Step 1 


It is understood that the individual assigned to the job should 
have a background in quality control methods and it is highly de- 
sirable to have had experience in manufacturing. In any case, he 
should be familiar with the processing of the part, because this 
will influence his decisions in the capability study. 


Step 2 


(a) Become familiar with the part. Study the dimensional charac- 
teristics involved. After some study it will become apparent 
as to which dimensions are most important to the capability 
study. 

(b) Some dimensions may be checked with a snap gage because they 
are easy to control. Other dimensions must be checked with 
gages where readings of variability can be obtained. Usually, 
these are the dimensions which are used to determine whether 
the machine being studied is capable of processing a desired 
part to a given standard. 

(c) The design of a part should also be scrutinized for any pos- 
sible effect on the study. 

(ad) Consider the kind of material used to make the part. This is 
frequently a factor in tool life, kind of tooling, rate of 
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speeds and feeds, etc., and will also extend into gaging pro- 
cedures and gaging accuracy. 


Step 3 


After becoming familiar with the part, the next step is to check 
the condition of the pieces coming to the machine. The existing 
conditions will have a definite bearing on the capability study. 
Frequently, the roughing operation must be corrected before a ca- 
pability study of the finish operation can be attempted. Care 
should be taken that parts are not specifically selected for the 
study. In all cases, the normal flow of acceptable parts should 
be used for the study. 


Step 4 


Study the process under question, What is the fabricating process? 
(Cut, grind, drill, pierce, hob, shape, form, etc.) Is it a single 
spindle or multiple spindle, single station or multiple station 
machine? How are adjustments made? Automatically or manually? 
How fast must the operation be performed? (Time cycle of process- 
ing time.) 


Step 5 


Verify the accuracy of all gages or instruments to be used in the 
study. Make several checks with these gages to determine their a~ 
bility to repeat a reading. Determine definitely whether the prop- 
er gages are being used. For instance, snap gage versus micrometer, 
micrometer versus shadow graph, two (25 orifice air gage versus 
three (3) orifice air gage, etc. 


Step 6 


Investigate all sources of information. Contact all people who may 
be concerned with the problem. 


1. Engineering 
2. Inspection 
3. Production 
4. Processing 
5. Any others 


After the Quality Control man is thoroughly familiar with the part, 
gages, and process, and has obtained all the available information rel- 
ative to the part, then he must prepare himself to face the first obsta- 
cle in his path. He must be prepared to overcome opposition to a new 
way of determining a machine's capability. Our experience, at first, 
was that the machine builders were not too receptive to having extensive 
studies made in their plants, They felt that all final adjustments made 
after the machine was installed would make the machine capable of meet- 
ing all the prescribed standards. 


We found that a negative attitude toward statistical methods 
stemmed from several preconceived ideas. First, they lacked knowledge 
of Statistical Quality Control. Second, they felt that Quality Control 
only applies to manufacturers of small parts. Third, they failed to 
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grasp the idea that as closer tolerances are demanded of us, we, in 
turn, expect to obtain equipment capable of holding tolerances far more 
exact than the requirements needed years ago. 


As a result of our experience, a few suggestions are being submit- 
ted to help overcome these obstacles. These, we will discuss from the 
standpoint of anticipated purchase of new machines. 


1. 


3. 


4. 
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When entrance is made into the machine builder's plant, find 
out how much is known about Statistical Quality Control. Our 
experience indicates that they have heard about it but feel 
they have no use for it. 


Be prepared to spend some time explaining S.Q.C. Be certain 
that all explanations are confined to simple language that can 
be understood by a layman. 


Be certain the machine operator understands what you are at- 
tempting to do. Be deliberate in explaining Quality Control 
techniques. Further, let him become acquainted with your 
gages. Keep in mind that in the development of the machine, 
they used their own gages which may be different than the ones 
you intend to use. The operator's cooperation is a must. 


Explain to all people involved, directly or indirectly, the 
approach you intend to take and the net results you expect. 


Have the operator set the machine as near the mean dimension 
of the part print as possible. Run machine without further ad- 
justments until a sufficient number of pieces have been run for 
the study. 


Identify and maintain the order in which the pieces were run. 
The maintenance of chronological order may be required for fur- 
ther analysis if and when necessary. 


When the capability analysis is complete, a meeting of all per- 
sons concerned should be called. An honest and open-minded 
discussion should then occur, If the study is satisfactory, 
the machine in question may be accepted or prepared for ship- 
ment. However, if the study reveals unsatisfactory results, be 
prepared to explain the discrepancies or unacceptable portions 
of the study in an understandable and convincing manner. This, 
perhaps, is the most critical point in the entire study. You 
are, in effect, saying the machine is "no good", and the ma- 
chine builder will be ready to defend his position that they 
build high quality equipment. It is probable that his first 
defense will be an attack on the accuracy of the probability 
study. Due to lack of knowledge of S.Q.C. principles, the 
machine builder will then question the reliability of every 
procedure that was used, including the final analysis. The 
gages will be questioned, and also the condition of the parts 
that were machined and used for the study. Usually a sugges- 
tion will be made to use selected parts. Be careful not to 
fall into this trap. Selected parts may show what the machine 
is capable of doing under ideal conditions, but it is important 
to keep in mind that pieces coming to the machine in the plant 
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will not be selected, but will be normally produced pieces with 
variability as great as we can possibly stand from an economi- 
cal point of view. 


8. The foregoing steps were suggestions in making a capability 
study in the machine builder's plant. However, the study is 
not complete until the machine is released to production. The 
Quality Control man will need to work closely with the factory 
representative and plant personnel to see that the machine re- 
peats its qualification test. Here again, care should be exer 
cised to see that everyone concerned understands each step to 
be taken in making a good capability study. 


It might be well to mention here that many additional benefits are 
obtained as a result of making good capability studies. A partial list 
would include: 


1. Increase in number of good pieces produced 
2. Reduced skill requirements 

3. Higher morale 

4. Increased pride in workmanship 

5. Reduced scrap and rework 

6. Reduced cost 


Now, to show you how we went about making a capability study, I 
would like to refer to charts which give a pictorial story of the study. 


Figure 1-A shows the top view of the grinder with the relative po- 
sition of the part (distributor shaft) with the grinding wheel and feed 
wheel. Three (3) dimensions must be controlled -- the cam end, .3112 to 
23115; the bearing, .3732 to .3735; and the body, .4896 to .490. In ad- 
dition, taper, out of round, and finish must also be controlled. 


Figure 1-B shows the profile bar which controls the profile of the 
grinding wheel. 


Figure 2 shows how the first pieces produced measured. These were 
recorded in a tally form which immediately indicated that the dimensions 
were not at the same level, The .3112/.3115 dimension was running too 
high; the .3732/.3735 and .4896/.490 were running too low. It can be 
readily seen that if an adjustment is made to reduce the .3112/.3115 di- 
mension, the other two dimensions will go still farther below the lower 
specification limit. On the other hand, if adjustments are made to in- 
crease the .3732/.3735 and .4896/.490, the .3112/,3115 dimensions will 
go farther above the upper specification limit. In order to line up the 
dimension levels with respect to each other, it was necessary to take the 
following corrective action. 


1. Profile bar required lapping on the machine. 

2. Resting blade had to be realigned. 

3. Wheel grit was changed. 

4. Extra hard diamond for dressing had to be used. 

5. Two (2) passes instead of one were required to dress grinding 
wheel. 

6. Feed wheel required adjustment. 

7. Master cam which governs feed wheel contour had to be reground. 

8. Few other adjustments were required. 
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Following the preceding rework, the following study shows the re- 
sults accomplished. 


See Figure 3. 


The last three charts, Figures 4, 5, and 6, show how the grinder is 
reacting today, two years later. As you can see, the operation is well 
in control. 


In addition to the above capability study, an endurance run was 
made to determine the frequency of wheel dress requirements. Whereas 
it was an established practice to dress the wheel every 80 to 100 
shafts, studies revealed that the wheel could grind approximately 450 
shafts between wheel dressings. 


We at Delco-Remy feel confident that the time required to make ca- 


pability studies on new or reworked equipment is time well spent and 
adds immeasurably to the quality of our products. 
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TRIAL RUNS FOR QUALITY INSURANCE 


Harold J. Gulde 
Minneapolis-Honeywell Regulator Company 


Introduction 


The developmental phase of a new product probably has more to do 
with later reliability of the unit than any other one thing. At Honey- 
well a new unit must pass through a design phase, a product planning 
phase and a trial run phase before it is ready for regular production. 


Designs of new units are originated to satisfy a customer's specific 
needs or as a result of engineering research to find a better control. 
Two trial runs are made during this phase of the development. These 
are the building of engineering models and the testing of models in 
field application. Attention in this paper will be centered on later 
production trial runs rather than on trial runs during the design phase. 


The product planning phase accomplishes the following things: 


1. A preliminary classification of piece part characteristics 
is made, 

2. Processes are assigned or developed, 

3. Tools are ordered, 

h. Gages are ordered, 

S. Cost estimates are made, 
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- Method layouts are written, 
A trial run inspection procedure is written. 


Piece Part Trial Runs 


During the piece part trial run it is determined that piece part 
tools and processes are capable of meeting print tolerance. These trial 
runs may involve a machine capability study on the more critical dimen- 
sions. The critical dimensions were spelled out in the classification 
of part characteristics during the product planning phase. The number 
of parts checked from the trial run will vary from a very few on Punch 
Press operations to many on critical machining operations where the 
tool may wear rapidly. 


The final piece part inspection procedure is completed at the time 
of the trial run of the piece part; it is based on the preliminary 
classification and on the trial run. The procedure is a list of charac- 
teristics to be checked, an AQL for each and the gages for each. The 
AQL keys the sampling plan to be used. 


The First Trial Run of Units 
Trial runs may be made to establish any one of the following: 


1. That parts, all of which are to print, will assemble into 
units which meet specifications, 

2. That parts, from production runs of all piece parts, will 
assemble with normal assembly practice into units which meet 
specification, 

3. That future runs of production parts will assemble with normal 
assembly p~actice into units which will meet specification. 
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It must be decided which basis will be used for the trial run. 
Usually number three above is preferred although occasionally during 
this first trial run of the unit, sorted parts may be used per number 
one above. 


The first trial run of the units is under the supervision of 
Process Engineering. It is here that the ability to build the unit is 
really tested. The problem is, will the unit when produced under pro- 
duction conditions give reliable results, both immediately when received 
in the field and also over a predetermined life. The process capabili- 
ties are measured, gages are proved out, and the testing for repeatabil- 
ity is completed at this time. 


A fairly large group of units is processed (about 50-1000) under 
the watchful eyes of Honeywell engineers. Any irregularity is traced 
down before the unit leaves this stage. A preliminary inspection proce- 
dure on the unit is completed at this time. This procedure lists each 
characteristic and classifies it as Critical, Major, or Minor. This 
procedure will form a basis of the final inspection procedure in later 
runs of the unit. 


Units from this run are also put on life and shipping tests. Life 
tests are accelerated tests which show how well the unit will stand up 
under normal and adverse conditions. 


An example of the work done during the trial run is the repeata- 
bility - fixture - process variability study. Fifty units are numbered 
and read through on a fixture by an operator. On repeated readings of 
the fifty units, all previous readings are not available. Table I 
shows the results recorded from such a study. Table II shows the infor- 
mation to be obtained from means. Table III shows the variations 
measured by the example. 


Table I 
Repeatability - Fixture - Process Variability Study 
ae Fixture 1 Fixture 2 
Number Reading 1 Reading 2 Reading 3 Reading Reading 5 Reading 6 
1 9.5 10.1 9.6 10.1 12.7 13.0 
2 9.2 9.5 10.4 11.2 12.5 32.7 
3 Tel 8.2 8.6 9.1 10.1 10.3 
4 8.6 9.1 9.1 8.6 9.9 10.0 
5 10.3 12.3 11.5 11.4 12.5 12.0 
6 Tee 7h 8.5 9.5 10.6 10.2 
7 10.9 12.1 12.8 11.0 11.8 11.5 
8 9.0 10.5 10.2 10.0 11.9 11.0 
9 9.2 9.0 9.3 10.8 10.7 11.3 
10 8.1 8.6 8.5 10.7 10.9 10.8 
x 8.9 9.7 9.9 10.2 11.4 11.3 


Table II 
Mean Results from 
Repeatability - Fixture - Process Variability Study 


Information Obtained from Means Means to Use 
1. Over-travel in the first reading. 1. Reading 1 vs. Reading 2. 
2. Drift in the unit. 2. Reading 2 vs. Reading 3. 
Reading 5 vs. Reading 6. 
3. Comparison of fixture mean reading. 3. Readings 1, 2, 3 vs. 
Readings , 5, 6. 
4. The process average. lh. Reading 2 or Reading 5. 
Table III 


Variation Results from 
Repeatability - Fixture - Process Variability Study 


Information Obtained from Variation Variation to Use 
1. Repeatability of Fixture l. 1. Paired differences Reading 
2 and Reading 3. 
2. Repeatability of Fixture 2. 2. Paired differences Reading 
5 and Reading 6. 
3. Process variation. 3. Variation within the 50 unit 


readings of Reading 2 or the 
50 unit readings of Reading 
5. 
h. Severity limit necessary between 4. Paired differences between 
Operating and Inspection. Reading 3 and Reading 5. 


Production Acceptance Run - The Final Trial Run 


The next hurdle for the unit is the Production Acceptance run. In 
this phase the unit is built by production people under production 
supervision. All assembly tools and gages must be performing satisfac- 
torily at this time. This run is usually fifty units, but on important 
developments it may be as large as 1000 units. 


The process variation on each assembly characteristic is checked 
again. If the distribution is normal it must be comfortably inside of 
specification. It is not enough for the six o spread to be just inside 
of specification. This is particularly true with a characteristic that 
is non-adjustable such as wattage. In such case, a new set of parts 
can easily shift the distribution. An adjustable characteristic however, 
such as calibration can usually have the six o limits nearer specifica- 
tions. Also, this type of characteristic is easier to salvage once a 
rejection has occurred. 


The inspection procedure on the unit is completed at this time; 
each characteristic classification is approved. The sampling plans are 
finalized and the inspection stations are set up. 


A sample from the run is sent to Engineering. Floor results are 
correlated with results on Engineering Test fixtures. There are alsc 
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many additional design tests run at this time over and above those run 
on the floor. 


pass 


Summary 


A unit developed by Honeywell and in production must successfully 
the following trial runs: 


1. Engineering models, 

2. Test of Engineering models, 

3. Field models tested in actual use, 
. Trial run of piece part, 

S. Trial run of wit, 

6. Life Test, 

7. Shipping Test, 

8. Production Acceptance run, 

9. Engineering tests of final unit. 








DISTRIBUTION REQUIREMENT SPECIFICATION 


B. J. Kinsburg 
Bell Telephone Laboratories 


1. Introduction 





The distribution requirement specification which is presented in 
this paper was worked out to meet a specific need of the L3 Coaxial 
Telephone System. The L3 System is a long distance carrier system de- 
signed to transmit either 1,860 telephone channels, or 600 telephone 
channels plus one television channel. The band width is approximately 
8 mc. The amplifiers are spaced along the route at 4 mile intervals. 
In a 4,000-mile route there will be about 1,200 amplifiers in tandem 
including those at the terminals. 


These repeaters or amplifiers supply a total of about 54,000 db of 
gain to make up for 46,000 db loss in the coaxial cable and 8,000 db 
loss in the associated equalizers at intermediate offices and terminals. 
The required match between the total amplifier gain and the cable and 
equalizer loss must be held to better than 1 db for a satisfactory 
system performance. The practical realization of such performance was 
made possible only through the application of the distribution require- 
ments on the components of the system; and in particular through a close 
control of process averages. 


To place the distribution requirements in a proper perspective two 
important steps in the design of the L3 System should be mentioned. 


Deviations Studies 





The -ircuit designs were analyzed to determine the sensitivity of 
the circuit to variations in the component elements. In case of the L3 
System the main characteristic of interest was the deviation of ampli- 
fier gain from its central or design value. The need for a quality 
control scheme to limit excursions of the long term process average from 
the central value was established early in the design stage and a Cam- 
mittee was appointed to work out a solution. The work of the Committee 
eventually resulted in the distribution requirement specification. 


Selection of Elements - Equivalent Limits 





Let us assume that each element in a circuit is allotted an equal 
share in its contribution to the variability pattern of the system. 
Using this assumption and the results of the deviation study it is 
possible to calculate the required tolerances on each component element. 
In actual practice what was desired is the most economical overall 
solution to the problem of distribution control. The tolerances so ob- 
tained were therefore modified to permit some elements a greater and 
some a lesser share in contributing to overall distribution depending on 
the inherent variability of each manufactured component element. These 
tolerance requirements on component elements were arrived at in consul- 
tation with the element designers. If available types of elements were 
not capable of providing the desired performance, new types of camn- 
ponents had to be developed. Of interest to the system designer were 
not only manufacturing tolerances, but also items such as reliability 
and variability with age and temperature. Thus arriving at a satis- 








factory solution required a large amount cf give and take between the 
circuit and the component elements designers. 


2. The Basic Decisions 





The committee which worked out the distribution requirement speci- 
fication consisted of Bell Laboratories and Western Electric Engineers 
with Mr. H. F. Dodge as chairman.* 


At the outset certain decisions were made which broadly defined the 
content of the specification. These decisions were: 


(1) To limit, with a fair degree of reliability, the systematic 
variations in the process average of the specified characteristic of a 
product from a design center or a nominal value (N) within a band de- 
fined by +1/3 o where o is determined by the basic capability of the 
process. 


Thus: PA tim = N+ 1/30 


(2) To accept all product within + 1 Go from the nominal value. 
Thus the limit on the individual values ch will be called the A limit 
becomes: 


A limit =N% 3RO=NTA 


Incidentally, this second decision leads to a simple numeric for the 
limit on the process average since now we can express 


PA tim=N't 20 = Nf 0.14 


The above two decisions are illustrated by Fig. 1. As shown on 
Fig. 1 our discussion will be confined to the case of characteristics 
having substantially normal distributions and having both minimum and 
maximum specified limits. In the overall plan, however, provisions have 
been made for characteristics having skew distributions or having a 
single specified limit, maximum or minimum. Fig. 1 shows two limiting 
acceptable distributions, normal in shape, both having a standard devi- 
ation o" equal to the standard deviation of the process, one having an 
average, X" located 1/3 o" below nominal and the other having an average 
1/3 o” above nominal. The allowance of * 1/30 is, of course, arbitrary 
and represents an estimate of relative importance of the systematic to 
random changes in the process average. 


(3) The third decision made by the Committee was broader in charac- 
ter than the two preceding and, therefore, more difficult to define. 
Briefly stated it was to provide sufficient flexibility so that the 
producer would not be unnecessarily limited in his choice of the manu- 
facturing process. Also it is necessary to furnish the producer with a 
method of operation while the process is out of control. 


* The original description of the distribution requirements appeared in 
Bell System Technical Journal, Vol. XXXII, No. 4, July 1953, pp. 943- 
967. "The L3 Coaxial System - Quality Control Requirements" by 
H.F. Dodge, B.J. Kinsburg and M.K. Kruger. Figs. 1,2,3,4,5,6,9 and 10 
of that paper are reprinted by permission of BSTJ from the above 
reference. 
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In order to implement the distribution requirements it is necessary 
to provide procedures which define: a) The character and quantity of 
evidence needed regarding the collective quality of a product as it is 
made day by day. Of necessity such procedures are of the nature of in- 
spection practices. b) The criteria for judging when such product may 
be conforming to the intent of the specification, and c) The treatment 
or disposition of product units when these criteria are not met. 


To implement the above decision, several procedures or methods have 
been prepared to meet the different conditions that may be encountered 
in production. We will discuss, with particular emphasis on the intent, 
four of these methods: 


a. control chart method 
b. batch method 

c. three-cell method 

d. records method 


(4) In some cases it would be desirable from the design point of 
view to apply distribution requirements to more than one characteristic 
of the product. For instance, it is desirable to control the process 
average of the gain-frequency characteristic of the amplifier at several 
frequencies. Another example is in the case of combined elements in 
which a coil is wound with resistance wire so that the resulting element 
has both resistance and inductance. 


It is easy to see that the administration of distribution require- 
ments in such cases may be quite cumbersome when one of the character- 
istics is out of control. The decision made in this case was to select 
one characteristic for application of distribution requirements and 
apply only go-no-go limits coupled with the records method to the re- 
maining characteristics. 


3. Control Chart Method 
Outline of the Method 


This method was designed to apply to an essentially continuous 
production process in which the sequence of individual units is not 
lost. To quote the specification it "is intended for application where 
production comprises a reasonably steady succession of individual units 
or small group of units fron®a common source; so that individual units, 
as produced, may be kept in the order of their production and control 
techniques applied to test results". 


The control charts specified are an adaptation of the Shewhart 
control charts for sample averages, X, and sample ranges, R, for "control 
with respect to a given standard" (see Part 3 of the ASTM Manual on 
Quality Control January 1951). Two modifications in the control chart 
for X have been introduced: a) A central band rather than a central line 
has been provided, and b) a run requirement has been imposed on seven 
successive sample averages. The sample size has been standardized at 5. 


The limiting distributions for sample averages are illustrated on 
Fig. 2. Referring to Fig. 2 the desired process average limits (PA 
limits) are the averages of the two acceptable distributions of averages 
of 5 units. 
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PA limits = N= .1A 


The limits for individual averages of samples as limited are derived as 
follows: 


( on) 
A5 limits = N t (1/3 o" + 3 —) 
( %) 


=N * (.1A + .4A) 
=N = .5A 


The limit for ranges (R5) is the customary upper 3-sigma control 
limit for ranges to be met by the range sample of 5 


R5 limit = 4.918 0" 
= 4.918 _ 
4.91 10 A 


= 1.48A max. 


At the outset, and whenever eligibility to use the control chart 
method is lost, a 100 per cent inspection rate is required. To establish 
eligibility for use of the control chart method, at least 7 consecutive 
samples must satisfy the following criterion. 


Criterion I - Establishing Eligibility 
a) The averages X, all fall within the A5 limits. 
b) The ranges, R, all fall within R5 limits. 


c) Seven consecutive averages, X, are not all outside the 
same PA limit (not all above the upper PA limit or all 
below the lower PA limit). 


Eligibility for use of the control chart method is maintained so 
long as each current sample satisfies Criterion II. 


Criterion II - Maintaining Eligibility 
a) The average, X, either 


(1) falls within the A5 limits; or 

(2) falls outside the A5 limits, but at the same time 
all of the 6 preceding consecutive averages fall 
within for A5 limits; 


b) The range, R, either 


(1) falls within the R5 limit; or 

(2) falls outside the R5 limit, but at the same time 
all of the 6 preceding consecutive ranges fall 
within the R5 limits; 


c) Seven consecutive averages, X, (for the current sample 


and the 6 preceding samples)do not all fall outside 
the same PA limit. 
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dad) No major change is made in raw material, machine set-up 
or personnel, which, in the judgement of the functionally 
responsible personnel, may have a significant effect upon 
the quality of the product. 


Loss of Eligibility 





When the current sample fails to satisfy Criterion II the three- 
cell method applies to the product represented by the current sample, 
and to subsequent production until eligibility is re-established by 
satisfying the conditions of Criterion I. 


Operating Characteristics of the Control Chart Method 





The operating characteristics of the control chart method are 
shown on Fig. 3. In computing these curves the statistical models used 
to evaluate the expected performance were limited to normal distri- 
butions. The ordinate is the probability of accepting the current _ 
sample and the abscissa is the displacement of the process average, X, 
from the nominal value, N. 


Fig. 3a shows the OC curves for Criterions I and II when the 
standard deviation of the process o' = 0.3A. The Criterion I is seen, 
as would be expected, to be more stringent than Criterion II. 


Fig. 3b shows what would happen if the non-parametric requirements 
on the 7 consecutive points were omitted. These curves are labelled 
"I, II, less C". It is seen that without this requirement Criterion I 
and particularly Criterion II would be very ineffective in controlling 
the departures of the process average from the desired value (N). 


*Fig. 3c shows the operating characteristics for Criterion I when 
o' # 0.3A. This would happen if the original specified limits were im- 
properly set or it the process dispersion changes. Similar curves for 
Criterion II are plotted on Fig. 3d. As would be expected, Criterion 
II is not as effective in constraining departures of the process average 
from N. 


4. Batch Method 





Outline of the Method 





This method is intended for application where production consists 
of intermittent batches of 50 or more units all of which have been made 
under the same essential conditions with respect to materials, parts, 
workmanship and processing. The assumption is that the time sequence 
of production of individual units even if it could be traced is not 
significant and the only order which has a meaning is the sequence of 
successive batches. It is also assumed, however, that there is a basic 
continuity in raw materials, parts and workmanship, which is insured by 
appropriate quality control method. 


In this m thod a sample of 50 units is selected at random from each 
batch. The average, X, and the standard deviation, o, of the sample are 
computed and the acceptability of the batch is determined by Criterion 
III. 
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Criterion III - Batch Acceptability 
a) The average X meets the A50 limits where 


A50 limits = N + (1/3 o” + 30°//50) 
= N 2 0.23A 
b) The standard deviation, o, either 


(1) falls within S5O limit where S50 limit 
= .41A, max, or 


(2) fails to meet the S50 limit, but at the same time all of 
the 6 preceding consecutive standard deviations meet the 
S50 limit. 


The basis for A50 limits is shown on Fig. 4. 


Operating Characteristics of the Batch Method 


The operating characteristics for the batch method when o' = 0.3A 
is shown on Fig. ie. For comparison purposes this figure also shows the 
corresponding curve for Criterion II of the control chart method. 

Figure 6b shows operating characteristics for several values of o' rela- 
tive to A. As would be expected, the batch method, due primarily to the 
use of a relatively large sample, is somewhat more efficient in con- 
trolling the process average than the control chart method. 


5. Three-Cell Method 





Description 


Even the best behaved process is capable of going out of control 
and staying out of control for a considerable period until the assign- 
able cause or causes are found and corrected. If in the interim the 
shipments of the finished products stop, the effect on the interlocking 
schedules may be disastrous. Also what can the harassed manufacturing 
organization do with the uncontrolled product already on the shelves? 


The solution of this problem is the three-cell method and is due to 
the genius of H. F. Dodce. It provides a procedure for continuity of 
manufacture when the product is out of control. This is a procedure 
which permits shipment of some of the product and at the same time in- 
sures the consumer that the shipped product will have a satisfactory 
distribution. 


The three-cell method may be used at the discretion of the manu- 
facturing organization in place of either the control chart or batch 
method. The use of this method is of course mandatory when the product 
fails to meet the criteria of the other methods. 


The basis for the three-cell method is shown on Fig. 5. Each unit 
of the product is tested and conforming units are classified in one of 
three cells: lower, middle or upper. The product is shipped in groups 
of 5 units, each group consisting either of 1 unit from the lower cell, 
3 units from the middle cell and 1 unit from the upper cell or all five 
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units from the middle cell. In this way each group of five units forms 
a small distribution of its own. 


In addition to sorting the conforming units in three cells and re- 
quiring shipment only of a selected distribution this method also re- 
quires maintenance of control charts on average (X) and range (R) using 
random samples of 5 units selected in the order of production prior to 
classification. In this way a record is kept of the process showing 
when corrective action should be taken. 


Operating Characteristics of the 3-Cell Method 





The corrective effect of the 3-Cell method on the process average 
when c' = 0.3A is shown on Fig. 7. In this figure the abscissa is the 
deviation of the process average (x') from the nominal (N) and the 
ordinate the deviation of the average of the packaged product. The solid 
line shows the average long term corrective effect produced by packaging. 
The other two curves show the corrective action if either only 0-5-0 
packaging (dashed line) or only 1-3-1 packaging (dot-dash line) were 
employed. The efficiency of the method is seen to be a factor of ap- 
proximately 3. This efficiency is obtained at a price as seen on Fig. 9. 
In this figure the abscissa is, as in the previous figure, the deviation 
of the process average. The left hand scale applying to the upper 
(solid) line is per cent of product packaged. The right hand scale ap- 
plying to the lower (dotted) line is the average of the packaged product. 
For instance, when the process is centered 0.2A away from the nominal 
value, only 70% of the product can be packaged. At the same point the 
average of the shipped product is about .07A away from the nominal, thus 
confirming the 3 to 1 efficiency factor. Of course, if the process ex- 
hibits a shift in the opposite direction it may be possible to ship the 
unpackaged units. This incidently furnishes an incentive for corrective 
action. 


One of the pitfalls of the 3-Cell method is seen on Fig. 8, where 
the effect of distributions in which o' < 0.3A is shown. This may occur 
if the original limits are too lenient or if a change in the process 
takes place resulting in a tighter distribution. On Fig. 8 the abscissa 
is the deviation of the process average from the nominal and the ordinate 
the deviation of the packaged product. For instance when A = 80 the 
3-Cell method performs essentially no corrective action. This is an 
additional reason why maintenance of conventional control charts to 
supplement the sorting operation is required. 


6. Records Method 


The records method primarily involves maintenance of control charts 
of the product. This method is intended for application when it is not 
practicable to apply the other methods which were described above but at 
the same time some control is desirable. The records method is usually 
specified when there are several characteristics of the product in which 
it would be advantageous from the system point of view to minimize de- 
viations of the process average from the nominal value. In such cases 
the most important of these characteristics is selected for application 
of other methods and the records method specified for the remaining 
characteristics. An example are the combined elements in which a coil 
is wound with a resistance wire on a ceramic spool. The resulting unit 
has to meet both inductance and resistance limits. In this case the in- 
ductance was selected for application of the control chart method (or 
the three cell method if preferred by the manufacturer) and the resist- 
ance was administered under the records method. 
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In effect this is a "pretty please" method in which the manu- 
facturer is asked to control the product but the action taken to 
achieve control is left to his initiative. At the same time it has two 
important advantages; 


1) The customer and the producer have the evidence of control 
charts. 


2) The producer knows from the manufacturing specification 
that control charts are required and therefore includes 
them in his layout of the process and in the pricing 
structure. 


7. Accuracy of Measurements 





Any framework of distribution requirements would be incomplete with- 
out taking into account the accuracy of measurements. It is convenient 
to separate the accuracy of measurements into two major components: 
precision and bias. By definition precision contains all random con- 
tributions and may be assumed (unless contrary evidence exists) to be 
normal in character. The bias contains all systematic components. 


The effect of precision is illustrated on Fig. 10. As seen from 
this figure the effect of the normally distributed error of measure- 
ments is to produce an apparent distribution of the measured values 
which is greater than the distribution of the true values. Let us 
designate the measured distribution by q and the distribution of true 
values by r and the distribution of measuring error by s. 


The relationship between q, r, and s may be readily calculated from 


q =,/re + s¢. The calculated rations of q/r for given rations of s/r 
are listed in Table I below. 


Table I 


Effect of Precision 


Ratio s/r Ratio q/r 


1 1.414 
5 1.118 
22 1.020 
a 1.005 


In the distribution requirements specification the overall instru- 
ment accuracy not poorer than 1f6A was specified with allowance of 1/10A 
for precision. This in effect allows 1/15A for bias. 


In the application of the distribution requirements to specific 
components the battle of accuracies had to be won first. As a rule the 
process capability studies had to be made and in many cases new measuring 
equipment had to be developed before adequate precision was achieved. 
Improvement of measuring equipment or techniques in addition to achiev- 
ing required degree of precision also tends to reduce bias to tolerable 
values. In some cases, however, comparison standards had to be intro- 
duced. Use of comparison standards is an art in itself and is outside 
the scope of this paper. 
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8. Application 





The application of the distribution requirements to the L3 System 
was a step by step process. These steps are outlined in brief below in 
order to furnish some insight as to how distribution requirements fit 
into development of a complex transmission system. 


Component Elements 





The use of the distribution requirements was specified for all 
critical component elements in the L3 amplifier. Some of these were 
specially designed for use in these amplifiers because the: available 
elements did not meet the system objectives based on early deviation 
studies. Each instance of realization of these special designs is an 
interesting quality control story in itself. Some of the quality control 
aspects of this work in connection with the vacuum tubes were published 
in AIEE transactions.* An early summary of application of distribution 
requirements to manufacture of other component elements is also avail- 
able.** As in all development work most of the effort remains untold 
though the results have been incorporated into the general pattern of 
the Bell System. 


Field Trial 


In this stage a field installation was equipped with amplifiers 
made in the factory on a preliminary production line utilizing components 
which met distribution requirements. The resulting system was subjected 
to comprehensive tests. Any significant deviations from design center 
values were examined to ascertain assignable causes. All cases of 
failure were analyzed, and necessary remedial action was taken. 


Production 


Initially the distribution requirements were accepted by the Western 
Electric Company for the component elements only. From the very begin- 
ning, however, the overall amplifier gain characteristics were watched 
for conformance with the distribution requirements. The control charts 
were being plotted and corrective action when required was initiated by 
production engineers. After about two years of production experience 
the distribution requirements were incorporated into the manufacturing 
specifications. The control chart method was applied to the amplifier 
gain at the top of the transmission band, and the records method was 
applied at two other frequencies, one in the middle and one at the 
bottom. A typical control chart of the overall amplifier gain at 8.32 
megacycles is shown on Fig. 11. The excellent control displayed in Fig. 
11 is not an accident but the result of long and painstaking effort. 





* Application of Statistical Techniques to Electron Tubes for Use on a 
4000 Mile Transmission System - by W. Van Haste AIEE Transactions 
Part I No. 23, pp 50-54, March 1956. 


** The L3 Coaxial System - Application of Quality Control Requirements 
in the Manufacture of Components - by R. F. Garrett, T. L. Tuffnell 
and R. A. Waddell. 


The Bell System Technical Journal, July 1953, pp. 969-1005. 
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OPTIMIZING A LIFE TESTING PROGRAM 


Leonard G. Johnson 
General Motors Corporation 


I. Introduction 


In recent years the science of operatios research has been applied 
to a variety of economic situations. As a rule, in economic problems 
we are interested in either minimizing a cost or maximizing a profit. 
The operations research method of arriving at the maxima or minima of 
an econany is to set up a mathematical model of the system, and then 
proceed to analyze the mathematical equations for maxima and minima. 
Amongst the mltitude of industrial problems lending themselves to this 
type of approach, we can find the very interesting and important ques- 
tion of how best to design a fatigue testing program for a given type 
of specimen, be it an automotive chassis part, a ball bearing, a rail- 
road rail, or even a clock spring. Many other life testing programs 
fall into the same general category, including the testing of light 
bulbs, endurance tests on rubber tires, and corrosion and wear life 
tests. The design of a fatigue testing program involves two basic 
economic considerations, namely, time and money. Actually, the value 
of the time element can be expressed in dollars and cents, and, there- 
fore, we should not consider time to be independent of money. However, 
since it is usually desirable to know how mich time a testing program 
will consume, regardless of cost, it is advisable to analyse time and 
cost separately. 


ide Assumptions 


1. The fatigue life distribution of the item under consideratim 
is a Weibull distribution. 


2. The desired coefficient of variation in the mean life has been 
specified. 


3. A fixed number of specimens are run simltaneously (with re- 
placements as failures occur) until the test is completed. 
(The fixed number involved here shall be denoted by n). 


lk. The manufacturing cost per specimen is know. (Denote this 
cost by Cs). 


S. The waiting cost per hour is known. (Denote this cost by C,). 
III. Questions to be Answered 
1. How many specimens mist be failed in order to give the desired 
coefficient of variation in the mean life? (The answer to 
this question will be a value know as r). 


2. What is the median waiting time for a given value of (2)? 
(Denote this answer by als )- 


3. What is the total cost of the testing program for a given (4) 
and (=) ? 
s 
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b. For a given (<=) , what value of (=) will minimize the 
total cost? s 


IV. Properties of the Weibull Distribution 





Before we can answer any of the questions listed in III, we mst 
derive certain mathematical properties of the Weibull distribution which 
will be used as tools in obtaining the required answers. 

Let X = the fatigue life of one specimen (in hours) 


Then, according to Weibull [1], the probability that any specimen 
will fail within X hours is F (X), where F (X) is given by the formla 


F(x) = 1- Exp | -(3)'| Gaal (2) 


(e is the base of the natural system of logarithms) 

(b and @ are known as the parameters of the Weibull 
distribution) 

(b is called the Weibull slope) 

(@ is called the characteristic life) 


By transposition we can write (1) as follows: 


1-F(x) = Exp [-(3)°| (2) 


By inverting both sides of (2), we obtain 


er - : ex |(3)'| (3) 


Taking the natural logarithm of both sides of (3) gives 


b 
— a 
loge == ( Z) (4) 
Taking the natura) logarithm of both sides of () gives 


4 


to — om 
fe tog 1 - F(x) 


= blog.x -b log. f (5) 
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Weibull probability paper is constructed by using (5) as a basis. 
Along the abscissa scale we list the value of log x, i.e., the abscissa 
is a logarithmic scale. Along the ordinate scale we list the value of 
log logy++- . We call this ordinate scale a Weibull scale of the 
probability F. Hence, with x plotted om a logarithmic scale, and with 
F plotted on a Weibull scale, we obtain a straight line of slope b. 
This explains why the parameter b is know at the Weibull slope. 


Three important properties of any probability distribution are: 
1. Mean 
2. Median 
3. Standard deviation 


These can he expressed in terms of the parameters b and © as 
follows: 
MEAN=M-= §-[(1 + 44) (6) 
A 
MEDIAN= L, = 9- (log .2) (7) 


STANDARD DEVIATION = T = Vr (1+2/) -P'( 4s) (8) 





The life for which the probability of failure is q we denote by 
Lg- It is obtained by solving (2) for x when F (x) is taken equal to 


q. Thus, 
y 
Ls O(log. : ms ) (9) 


The standard deviation for a mean as estimated from a sample of r 
failures has been show [3] to be equal to the population standard 
deviation divided by the square root of r. Thus, 








Try =F = EYre- eM? F 


— - 4 \ aa 


M P* (4+ 4/4) ** -_ 











The ame « = k is know as the coefficient of variatim of 


It is a measure of the accuracy of the estimation of the mean life 
from a sample of r failures. 


According to assumption 3 in IJ, n specimens are running sim1- 
taneously. Consequently, any failure that occurs represents the first 
failure in a collection of n items. The median life of the first fail- 
ure in n new items from a common Weibull population is obtained by 
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raising (2) to the power n, and solving for x when fi -F(x)]” =%- 


Thus, MEDIAN LIFE OF FIRST FAILURE IN n “ayX.s 
= 9. (20:8) a2) 
je n 


The Answer to Question Number 1 





The number of specimens which mist be failed in order that a given 
coefficient of variation in the mean life can be realized is obtained 
by solving (11) for r. 


Mr G+2h) _ 
r*(1+1/s) 
K 2 





Thus, r= (13) 


The analytical relatio (13) is plotted in Chart 1 for Weibull 
slopes b = 1, 1.5, and 2. 


The Answer to Question Number 2 





The median waiting time required in order to fail r specimens when 
n are kept running simltaneously (with replacements of failures) is 
the median of the sum of r time periods, each period being the elapsed 
time between two successive failures. 


Equation (1) tells us that x” is exponentially distributed with a 
mean value g?. 


Let x? =y 
b 
Then F (y) = 1 - Exe (-y/9*) (uy) 
Since y is expomentially distributed, it follows that aging does 
not affect the distribution of future incremental values of y (See refer- 
ence [)). 


Let yz the value of x? up to the lst failure, 


Y> = increment in x from the lst to the 2nd failure, 


743° increment in x? fram the 2nd to the 3rd failure, 


y, = increment in x? from the (r-1)th to the rth failure. 


The median value of each ys 
power of (12), i.e., 


(a) Ms = g (=#:2) (15) 
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(i=1,2, ..., ©) is given by the bth 


Hence, the sum of the median values of y,, Yoo +++» Vp is r times 
(15), i.e., 


S$ = ro (22823) (16) 


n 


The quantity S has dimension (TrME), In order to reduce its 
dimension to that of TIME we take the bth root of (16). Tis, 


1 1 % 
[Sum oF (MEDIAN INcREMENTS)®) "*. 5°. r'* ae : *) (17) 


In order to reduce (17) to the MEDIAN OF THE SUM OF THE TIME INCRE- 
MENTS BETWEEN SUCCESSIVE FAILURES, we multiply by an appropriate factor 
G . Hence, the median waiting time becomes 


al, = ar ¢ (toned i)” (28) 


An empirical expression for ¢ which has proven to be satisfactory 
in the author's experience is given by 


4/ 
g « [p= G12) "rl r (i+ t/s) (a9) 


r-in2 r (r+ 1/p) 


It should be pointed out that the illustrations in this report are 
valid regardless of the true values of ¢ , since the charts 1 through 
3 are independent of # , and charts | through 7 are constructed by 
assuming the product $¢ Le. = 00, without specifying any fixed num- 
bers for or L ¢. 





For n = r the median waiting time (18) is 
1/,, 2 2 % 1, 


Hence, the median waiting time ratio is 


> 1 


The relation (21) is plotted in charts 2 and 3 for Weibull slopes 
b=1, 1.5, and 2. 


The Answer to Question Number 3 





Let Z = median total cost of the fatigue testing program. We break 
up the total cost into two components. These are: 
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1. Manufacturing cost of the specimens 
2. Waiting cost (including operating expenses, cost of 
delay in putting a better product on the market, etc.). 
The number of specimens which mst be manufactured isn+r-l, 
since immediately before the final failure there are n specimens run- 
ning and (r-1) have previously been failed, making a total of n + (r-1) 
which mist be available for the test. Therefore, TOTAL MANUFACTURING 
COST = (n+ r - 1)C,. (22) 
(C, = manufacturing cost per specimen) 


( The median waiting time required to fail r specimens is given by 
1): 


Let Cy denote the waiting cost per hour, which we assume is known. 


Therefore, MEDIAN TOTAL WAITING COST = Caf ay Ts] 
102 "he 1%, -V%& 
= prg(teqc= orl. cn (23) 


° Z = (ntr-1)0, + rei co ni’ (2h) 


For (=) = 1, this total cost becomes Z, = @r-1)¢ +r*L cp 


Hence, the median cost ratio is: 











n: $= oer Ngt Fete 5 _ @+GH)+ er ‘isle (F) - 

od hf 
eer Oe 

Let (2) = ¥ Then 

o - Cw a 
a ¥ +(*) eae “tlt ag (26) 
2-4 reel, (S) 
Let i = kK 





2-¢+¢r" L (=) 


me K [rete eortu(ejes] © 
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The relation (27) is plotted in chart for b = 1, and in chart 6 
for b=1.5. It is assumed that r = 20 and L ¢ = 00 hours. 
S 
The Answer to Question Number 4 





To find the minimum value of Ql in (27), we differentiate with 
respect to F and then put “8 = 0. 
d 
Thus, 


in — +1 Ca “tly 4) 
@. lt-¢r L() § |=° (28) 


Solving (28) for the optimum value of F we obtain 
b 


7 lis (=) bet 
io, 35 =) (29) 





Ay 





Substituting (29) into (27) we obtain for the minimum cost ratio 


i- < + (1+) Soy. 


2-Lr+r ee 


Opt. 


(30) 





a vas 


Relations (29) and (30) are plotted in chart 5 for b = 1, and in 
chart 7 for b = 1.5. It is assumed that r = 20 and L . = 400 hours in 
both charts. . $s 


Conclusions 


From charts 5 and 7 we conclude that whenever the waiting cost per 
hour is several times the manufacturing cost of a single test specimen, 
it is most economical to simltaneously run several times as many speci- 
mens as we are going to fail. In case a fatigue testing laboratory does 
not have sufficient capacity to accommodate large values of (2) > it 
would even pay to expand the test facilities to a point where large 

(=) values can be handled. In the long run, the drastic reduction in 
total testing cost would soon pay for the cost of additional expansion. 
Furthermore, with large values of (%) fatigue tests are completed in a 
fraction of the time it takes to complete them with small values of (3). 
As a rule of thumb we can say that a large (= ratio requires a large 
(2) ratio. On the other hand, if (@) is very small, it can be seen 
that there is no justification for a large (#) . In short, the more 
expensive it is to wait, the more test facilities we need in order to 
operate at peak efficiency. This is nothing more than sound business 
judgment. 
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THE USE OF QUALITY CONTROL TECHNIQUES IN OPERATIONS RESEARCH 


R. W. Lindsay, Operations Research Supervisor 
American Enka Corporation 


This report covers an Operations Research project in which it was 
found desirable to lean quite heavily on techniques commonly associated 
with Quality Control. The project was a study of production and inven- 
tory control. 


Work on this project was undertaken in the following sequence: 
1. Formulate the problem and define its limits. 
2. Develop a tentative model. 


3. Test the model to see whether it is reasonable and 
make modifications where necessary. 


4. Develop the information sources and practical 
measures necessary for actually using the model. 


5. Determine whether use of the model accomplishes the 
purpose for which it was developed. This was done 
first by simulation and then by actual trial use. 


6. Develop a procedure for ensuring that the model 
remains consistent with actual conditions in the future. 


I believe you will see a lot of similarity between the above steps 
and those necessary to carry out a successful quality control project. 


Before going into detail, I would like to give you a brief picture 
of the background of the project. 


The rayon industry lived in an almost continuous sellers' market 
from its infancy in the Twenties to the post-war years of the late For- 
ties. During this period, production and inventory control was rel- 
atively easy. As the industry came of age and moved to a buyers' market 
this control area became a headache. Our major problem took the form of 
frequent production program changes. The processes in a rayon plant are 


such that excessive program changes are very expensive in terms of cost, 
quality, and morale. 


We were very interested in Operations Research, and since produc- 
tion and inventory control seemed to be a fertile field for this disci- 
pline, we decided to make the problem our first major project in Opera- 
tions Research. 


First we had to formulate the problem, define its limits and de- 
velop a trial model. Im the initial phase, we seem to have departed 
from the path generally followed in this type of study. In most publish- 
ed studies on production and inventory control, the problem is defined 
primarily as that of obtaining an optimum balance between customer short- 
ages, inventory and program changes. In our case, we were fairly well 
satisfied with our past performance in so far as inventory and customer 
shortages were concerned. It was also realized that the difficulty of 
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developing a realistic model of the comparative cost associated with 
these variables would greatly complicate the study. If we could reduce 
program changes without increasing either inventory or customer short- 
ages, we would be certain that we were getting a real profit increase 
even without such cost figures. We, therefore, decided to make this our 
goal. 


Examination of historical data on shipments showed large variations 
for individual products between time periods. The patterns were such as 
to lead us to suspect that a large part of the variation was of a random 
nature. However, it was obvious that significant changes in the true 
average market demand were superimposed on the random variations from 
time to time. We decided to try to design a probability model as an 
analysis tool to be used in differentiating between these two types of 
variation. 


Since shipments are discrete occurrences in time, theortical con- 
siderations pointed to a Poisson distribution as the most likely one. 
However, some change of variable was necessary because the Poisson dis- 
tribution deals with the number of discrete occurrences; whereas we 
were interested in the poundage involved in shipments, not in the number 
of shipments, as such. 


The problem is similar to that of analyzing the number of cars in- 
volved in highway accidents. The number of accidents occurring may fit 
the simple Poisson, but the variation in the number of cars involved per 
accident makes a compound distribution necessary. Our problem was more 
difficult because the possible variations in poundage per shipment are 
almost infinite. 


We tried fitting a Poisson curve to our shipment data, using an 
average weight per shipment. This attempt was a complete failure. In- 
vestigation showed that the small percentage of large shipments, by their 
occurrence or non-occurrence in a period of time, caused the actual data 
to vary much more than our model. However, the general shape of our his- 
tograms still suggested Poisson. 


We then designed a shipment function for each product line which 
would give increased effect to the larger shipments. The function was 
calculated by dividing the summation of the squares of the individual 
shipment weights by the summation of the individual weights. 


Such a function can be justified theoretically. It is generally 
accepted that the sum of several variables, with the same basic distribu- 
tion, will result in a distribution with a variance equal to the sum of 
the variances of the individual distributions. The variance of the dis- 
tributions based on our shipment function meet this criteria. 


A further check on the validity of the basic model was made by con- 
paring the resulting shipment distributions, for a representative group 
of products, with the corresponding Poisson distributions by means of a 
chi-square test. A good fit was obtained in most instances. In the in- 
stances where the fit was poor, it was obvious that the actual data was 
not all from a common distribution. In other words, a change in demand 
level had occurred for the product concerned during the time period cov- 
ered. Figure 1 shows a typical histogram comparing the actual and theo- 
retical distributions. 
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We also found that the calculated shipment function varied with the 
average demand level. Luckily, this was found to be a straight line 
function, and we were able to establish workable relationships for vari- 
ous product groups from existing data. There was some interesting devel- 
opment work involved. When semples are drawn from a two-dimensional 
distribution, a scatter diagram of elliptical shape is formed, assuming 
normal distributions. There are three regression lines which may be 
calculated from the data. (See Figure 2.) We have the regression of y 
on x, when y is the dependent variable and x is the known variable. 

This line is useful in estimating y from a knowledge of x. We also 
have the regression of x on y, when y is the known variable. In ad- 
dition, we have a third line which represents the structural relation- 
ship between the variables. Such a line is sometimes called a weighted 
regression line. One situation where this line is needed occurs when 
it is not practical to estimate either variable separately but the two 
must be estimated jointly. When shipments for a product go out of lin- 
its, changing the demand level means simultaneously changing the ship- 
ment function. Therefore, with both variables unknown, we are interest- 
ed in the structural relationship. 


Using these variables in combination with a Poisson distribution, 
we were able to construct realistic models of shipment activities. From 
these models, minimum and maximum shipments were determined throughout 
the volume ranges of our product lines. Figure 3 shows a chart for a 
typical product line. The center line represents the true average de- 
mand. The other two lines represent the limits of the expected random 
fluctuations about this average, for a predetermined level of confidence. 
It was necessary to construct such charts originally in terms of the 
shipment functions and then convert the scales to pounds. It will be 
noticed that the variation, as a percentage of the average, is much larg- 
er for small values of the average than for large ones. This is typical 
of a Poisson distribution. The limits are used as control limits to 
determine when a significant change in the demand level takes place. 
The limits also serve as a measure of the inventory required to cover 
normal shipment variations from inventory without a program change. 


I mentioned earlier that our objective was to reduce program changes 
without increasing customer shortages or inventory. We found that lin- 
its based on a five per cent level of confidence came very close to per- 
petuating the overall levels of inventory carried in the past. 


At this point in our work, the development and testing of the basic 
model was complete. The next job was to develop the practical measures 
for actually using the model. 


A major problem was encountered in determining the time period over 
which to analyze shipments. On a weekly or bi-weekly basis, only very 
large changes could be detected because of the large random variations. 
If a longer time period was used, the changed level could seriously 
affect inventory before it had been in effect through even one complete 
time period. 


We finally decided to organize data weekly and to compare the data 
for the latest periods of one, two, four, eight, and sixteen weeks with 
the proper limits. In this way the larger changes were discovered quick- 
ly by means of limits for the shorter periods. Small changes or those 
occurring gradually over a period of time were discovered by means of the 
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limits for the longer time periods. 


We also had another unique problem. It was not sufficient merely 
to determine that the demand level had changed for a product, it was 
also necessary to obtain a useful estimate of the new demand level and 
this, to be effective, had to be done on the basis of ome sample from 
the new population. 


We adopted the expedient of shifting the demand level just enough 
to make the revised limits include the out-of-limit value, as illustra- 
ted in Figure 3. In many cases this underestimates the magnitude of the 
change initially but eventually values close to the true limits of the 
new population will indicate accurately the true demand level. In the 
meantime, we would be in little danger of trouble with no extreme values 
occurring. 


I have talked mainly about shipments so far. However, we found it 
very beneficial to apply this technique to orders received as well as 
shipments. Shipments and orders behave very much as independent vari- 
ables with the same average. Orders are received some time before ship- 
ment is made, as a rule. The weight function is also different for 
orders, due to the tendency for customers to specify more than one ship- 
ment on an order. We found that a separate analysis of shipments and 
orders almost doubled our chances of detecting real changes in demand 
level over the chances using shipments alone. 


Using control limits in the manner described made it possible for 
us to determine the limits within which shipments would probably fall in 
a future planning period assuming no specific knowledge about that peri- 
od. However, these limits were rather broad, and we usually have a 
valuable piece of information about future shipments; namely, unfilled 
orders on hand. It seemed logical to make use of this information to 
narrow the limits. 


Since unfilled orders represent a partial knowledge of future ship- 
ments, taking them into consideration represents a problem in condition- 
al probabilities. 


let me draw a parallel: 


You wish to know the minimum and maximum number of heads you are 
likely to obtain by tossing 20 coins. Based on a five per cent level of 
confidence, you would expect between 6 and 14. However, let's assume 
you are tossing the coins one at a time. You have already tossed 10 of 
them and know this partial result. The minimum and maximm number to be 
expected from this particular trial is therefore the number obtained 
from the 10 already tossed, plus the minimum and maximum number to be 
expected from the remaining 10 coins, which is between 2 and 7. Thus 
the spread from minimum to maximum for this particular trial is only 6 
when the result of half the trial is known as compared to 9 without this 
information. 


We applied this principle in the case of the unfilled orders. The 
average order lead time was determined for each type of product. This 
lead time was then subtracted from the planning period and the maximm 
shipments determined for the remaining period. We call this figure the 
conditional component. The unfilled orders plus the conditional com- 


106 


ponent then gives us the maximum shipments for the planning period. 


We were now ready to test the practical usefulness of the model. 
The completed Production and Inventory Control Plan was applied “on 
paper" to a year's data. Paper programs were determined and resulting 
paper inventories were calculated using actual order and shipment data. 
The results as to average inventories, machine changes, and customer 
shortages were then compared with actual results for the same period of 
time using methods of control then in effect. The control plan reduced 
both machine changes and customer shortages by about fifty per cent 
while overall average inventories during the year were different by less 
than two per cent. 


Production programs had been determined in the past by the Sales 
Division on the basis of outstanding orders and current inventories con- 
ditioned by information obtained from customers as to their future buy- 
ing intentions. 


The Production and Inventory Control Plan was instituted for a trial 
period of six months, with the Sales Division retaining a veto power on 
the basis of their market information. The six months' experience was 
satisfactory. The Control Plan was then adopted and was turned over to 
the Production Planning Department, to be used on the same basis as dur- 
ing the trial period. Experience thus far is fairly well in line with 
predictions made as a result of the original application “on paper.” 


Plans were made to audit the control system periodically in order to 
keep the parameters of the model up to date. We are now engaged in the 
first of these audits. In the course of the audit, we are also re-exan- 
ining the whole structure for possible improvements. For imstance, our 
original model is based on an overall inventory in line with past expe- 
rience. We are now attempting to build into the model an optimum inven- 
tory concept which would minimize the total program change costs and 
inventory carrying costs. 


I have covered only the broad outlines of our development work. 
Many problems of a lesser nature were encountered both in developing the 
basic shipment and order models and in designing means for using theee 
models in preparing a detailed production program. Solutions to many of 
these problems were derived through simulation. 


107 








FREQUENCY OF OCCURRENCE - PERCENT 


30 


20 











_— 
iti 


—_ 





PO!IS 



































0 5 10 15 20 25 30 35 40 
WEEKLY SHIPMENTS (000 LB.) 


FIGURE | - ACTUAL AND THEORECTICAL DISTRIBUTION OF SHIPMENTS 
FOR A MEAN OF 10,000 LBS. PER WEEK, AND A SHIPMENT FUNCTION 
OF 5,000 LB. 


108 


Y - SHIPMENT FUNCTION 


CONTOUR LINES OF CONSTANT 
PROBABILITY DENSITY 





WEIGHTED REGRESSION 








X - AVERAGE DEMAND LEVEL 
FIGURE 2 - ALTERNATIVE REGRESSION LINES 


109 





CONTROL LIMITS 
(000 LB. SHIPPED INDIVIDUAL TIME PERIOD) 








60 + 


50 + 


40 - 


30 - 


20 - 


OUT OF LIMIT VALUE 


. 


FORMER 
DEMAND LEVEL 











~+—— UPPER LIMIT 


“4+—— AVERAGE 


“4—— REVISED DEMAND LEVEL 





! 
| 
P 
+ | 
10 ! +——~ LOWER LIMIT 
| 
0 T 1 T T T T Py T T t os 4 
0 10 20 30 40 50 60 


TRUE AVERAGE DEMAND 


FIGURE 3 - TYPICAL RELATION BETWEEN AVERAGE DEMAND AND 


CONTROL LIMITS 


(000 LB. | TIME PERIOD) 


110 








COMPENSATING FOR SYSTEMATIC EFFECTS DETECTED 
BY AN ANALYSIS OF COVARIANCE IN A MICROBIOLOGICAL ASSAY 


Norman F. Knowlden 
Lederle Laboratories Division, American Cyanamid Company 


The Situation: 


The basic methods for measuring the antibiotic content of a given 
material are microbiological. In this type of method, the inhibition 
which the material causes on the growth of a particular microorganism is 
compared to the inhibition displayed by different levels of potency of 
a standard antibiotic. One of these microbiological assay methods is 
known as a "turbidimetric" method. In this assay method, the amount of 
growth is determined by measuring the amount of turbidity which results 
from the concentration of microorganisms present. A filter photometer 
is used to measure the light blocking effect of this turbidity. A 
photocell picks up the light passing thru the sample and a galvanometer 
shows the intensity of this light. Thus the higher the galvanometer 
reading the greater the antibiotic potency and a typical growth-response 
curve results, (Slide 1). 


The general method (Slide 2) is to dilute both unknown and known 
samples to a level of potency where the change in growth rate is most 
sensitive to a change in antibiotic potency. This level is specific for 
every combination of antibiotic and microorganism resulting in inhibition. 
The diluted samples are placed in duplicate tubes. Into each tube is 
then placed an inoculum containing a medium in which the organism can 
flourish and a heavy concentration of rapidly growing organisms all being 
at about 37°C. The assay tubes are next placed in a 37°C. water bath 
incubator for 3 hours. Following this, the whole "test" is "killed" by 
the addition of a dilute formaldehyde solution to every tube. 


In the method studied, the standard antibiotic and the unknown sam- 
ples, based upon the predicted potency, were diluted to a potency that 
would give the desired amount of antibiotic in the assay tube. Then 
five subsequent dilutions were made of this and placed in assay tubes, 
in duplicate. Thus each sample was represented by 10 tubes in the assay. 
An automatic pipetting machine was then used to add the inoculum. This 
was added to the tubes in each row in order and to each rack in order so 
as to maintain an order throughout the whole test. The assay was then 
incubated 3 hours after which it was "killed" by the addition of formal- 
dehyde solution to every tube following the same order as that used for 
inoculation. Turbidity was then determined in a Lumetron Colorimeter. 
The galvanometer readings of duplicate assay tubes were averaged and the 
results "read" from standard curves prepared from standards run in the 
same way at the same time. 


The Problem and Its Analysis 





Now the problem as presented to me was routine. The laboratory had 
been running check samples, i.e. samples of the standard compared against 
itself, for the past two months. These potency results were presented 
to me and I was asked to calculate confidence limits for the assay. This 
is a standard type of calculation described in many texts. 


Upon examining the data, it was obvious without testing that there 
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was a drastic difference between the two months. Being samples of the 
standard one would expect the average potency found to be 100%. This 
was not the case. The first month's samples averaged 92% and ranged 
from 80% to 95% whereas the second month assay average was 99% and 
ranged from 95% to 103%. Why should these two months differ so? What 
was the cause of this bias? This was the real problem, not the calcu- 
lation of confidence limits. 


This situation was discussed with the laboratory supervisor and 
two routine suggestions were considered as possible answers for the 
difference between months. First we have the age-old possibility that 
the technicians had become aware they were assaying samples of the 
standard and were using "extra care” and secondly we have the possibility 
that sufficient experience with the assay had brought about this improve- 
ment. 


Neither of these possibilities seemed to fit the circumstances 
since both would affect precision rather than bias. Consequently I 
asked to see a routine assay being run. As stated before the standard 
and unknown samples were prepared to the appropriate levels and placed 
in duplicate tubes. These tubes were then placed in a rack in the 
manner shown. (Slide 3). The duplicates were always adjacent to each 
other with the tubes of any one sample common to one column in a rack. 
Each rack contained 4 complete samples, and 20 racks was the usual size 
of an assay. The routine procedure was to assign to the first rack 
tubes containing blanks which would be used to adjust the instrument. 
Following this would be the racks containing standard solutions for 
preparing a daily standard curve and following these would be the racks 
containing the unknown samples, each sample being completely contained 
in one single column of one single rack. 


The tubes were then inoculated (see arrows) proceeding from rack 1 
to rack 20. The racks were then placed in the incubator using a stand- 
ard arrangement (see diagram). After incubation the "kill" stage was 
carried out in the same order as inoculation and then the sample turbidi- 
ty read in order as the racks became ready after "kill." 


This systematic procedure proved to be very efficient as far as 
handling the tubes was concerned but like all systematic procedures it 
lent itself to introducing bias in the assay. As a matter of practice 
always examine how your data was collected. Many times explanations 
of a problem will be apparent and only in this way can appropriate ex- 
perimental designs be developed. 


To substantiate my opinion that this systematic procedure might be 
the cause of our trouble, I requested that sume experimental work be 
done. This was agreed to provided it could be arranged so that it did 
not interfere with the normal rate of output of assay results. Even 
though it was acknowledged that a bias may be present, it was recog- 
nized that this bias appeared to be such that it provided extra assur- 
ance that the true potency results were at least as potent as reported 
if not more so. The urgency of results, even if biased, was necessary 
to allow other development studies to continue. In production work 
this is quite often the case. Rarely can the most appropriate experi- 
mental design be employed without interfering with the production 
schedule. 
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My first request was that in the next assay, duplicate tubes of 
each of two concentrations of the standard, be included in each rack 
used, and the routine procedure be followed. Where these were placed 
in each rack was of little consequence as I was first interested in 
showing that rack differences existed. This experiment was agreed to 
as it was felt that not many samples would be deleted. The data was 
collected and analyzed as in Table I. Graphically the galvanometer 
readings were plotted as in slide 4. The circles represent individual 
readings while the astericks are the average of each pair of duplicates. 
It is evident here, as in the analysis of variance, that there is a 
highly significant difference between racks. Since the tubes of each 
sample was contained within one rack whereas the tubes of the standard 
curve was always in another rack, these rack differences evidently were 
the cause of our bias. Low galvanometer readings for samples would 
result in low potencies. 


Looking at the plot of results there appears to be a nice downward 
linear trend with the order of processing. Consequently the processing 
order was considered as another variable and an analysis of covariance 
performed on the same data as shown in Table I. What this does is to 
remove the effect of the processing order from the measurement of rack 
differences or graphically we now consider rack differences as whether 
there is any significant difference in the amount of deviation from this 
processing order regression between the racks. The racks were now 
found not to be significantly different. We now had our answer, the 
processing order could introduce bias. But how could this explain the 
bias found in the original data. Investigating the records kept over 
the time the original data was collected it was found that during the 
first month the check samples were always assigned to the last rack 
used whereas during the second month they were always assigned to the 
rack immediately following the standard. This was our answer. Also it 
was realized that in all past assays there existed an increasing nega- 
tive bias from the first to the last rack processed each day. 


This is all right to know but how do we go about eliminating this 
processing order effect so that future potency calculations will not be 
affected? A correction term for each rack could be developed each day 
but the calculations would be impractical for routine work. Randomi- 
zation was out of the question. Imagine randomizing 800 tubes. The 
extra time involved and means of identification of tubes would be 
tremendous. The possibility of confusion which would lead to complete 
loss of a days work was too great. Also unless several replicate tubes 
were used, randomizetion would only transfer the rack to rack variation 
to variation between duplicates and our resulting precision would te 


poor. 


Considering the routine that was being followed, it was reasoned 
that this systematic ordering could produce an effect in several places. 


1. During inoculation there was a time difference of 10 minutes 
between when the first rack was inoculated and when the last 
rack was inoculated. 


2. The position in the incubator could be important if the circu- 
lation were not uniform. 


3. The order of kill could be important since there again was a 
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10 minute lag between the first and last rack killed. 


4. The order of reading could be important if there were a drift 
in the photometer with time. 


5. Some combination of these might be causing the trouble. 


Discussing these possibilities with the personnel involved, it was 
decided that the incubator position was the most likely possibility. 
The 10 minute lag during inoculation and kill was not thought to be in- 
portant when you consider the tubes are in the incubator for 3 hours. 


Since the results of the first experiment were so definite, further 
experimenting was agreed to provided it would not interfere with pro- 
duction. We considered it best to investigate one factor at a time 
rather than design an overall experiment. It would take more time of 
course and would be troublesome if interacting factors were the cause 
of our trouble but we felt there would be less chance of confusion. 
Also the technicians could understand what we were trying to do and 
should we find a solution to our problem there would be little diffi- 
culty in installing any changes necessary. This is of prime importance 
in production studies. Unless an improvement, when found, can be 
implemented the study will be just a waste of time. 


Considering incubator position, a second experiment was run main- 
taining the usual routine except for placing the racks in the incubator. 
This was done randomly, using random numbers. Analyzing the results of 
this experiment, similar results as before were found (Slide 4). Pro- 
cessing order independent of incubator order was still significant. An 
analysis of covariance as in Table II verified this. Also an analysis 
of variance on the row and column positions in the incubator showed the 
incubator positions as not being important. 


Next the order of inoculation was considered. This was done by 
randomizing the order of inoculation of racks while maintaining the 
systematic order in the remainder of the assay. It might be well to 
mention here that at no time will randomization within racks be consi- 
dered for reasons stressed before. We felt that the systematic bias 
between racks was of greatest importance and would worry about within 
racks later. The data on inoculation, when collected and examined, a- 
gain presented a similar picture. Analysis of covariance showed 
processing order independent of inoculation order to still be the cul- 
prit. (Table III). 


Continuing we next randomized the order of kill from rack to rack 
maintaining the routine procedure in all other parts of the assay. 
Analysis of the results still showed rack differences but of a different 
nature than before. (Slide 5). Am analysis of covariance on processing 
order no longer showed improvement. (Table II). There was no signifi- 
cant processing order effect when independent of the order of kill. If 
we performed an analysis of covariance on order of kill independent of 
the routine order of the rest of the assay we could explain the rack 
differences. Graphically this picture, where we consider order of kill 
instead of processing order, would have a picture similar to before 
(Slide 4). Here was the root of our trouble, the order of kill. 


To round out all factors, the order of reading was next randomized 
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and found to be of no importance as a contributing factor. 


As a result of these experiments, it was concluded that the order 
of kill was the major factor of interest. Why should this be so? It 
was realized that once started, the length of time an organism continued 
to multiply was not only during incubation but also during the period 
between removal from the incubator and the addition of the formaldehyde. 
This could be from 0 to 10 minutes progressing steadily from the first 
to the lask rack killed. This was at a time when growth was progressing 
at a rapid rate whereas during inoculation the growth was progressing 
at a slow rate. 


How might this time difference be equalized from rack to rack? If 
our reasoning is correct, this is what is necessary to produce a suit- 
able assay. This time equalization must be done in a manner that will 
lend itself to routine usage. There is no sense finding a solution if 
we can't implement it. 


Let us return to the set-up used at the start of the assay (Slide 
3). We have duplicate tubes adjacent to each other in each rack. What 
purpose do they serve? They simply provide a measure of variation of 
the technician and machine from tube to tube. This is of little im- 
portance to us as far as assay error is concerned. Samples are con- 
tained in different racks so rack differences are of principal impor- 
tance when considering assay error. It is rack differences that must 
be minimized as much as possible. 


The procedure after the tubes are prepared, is to inoculate these 
tubes using an automatic pipetting machine so the time interval is 
constant between tubes. The killing step also has equal time intervals 
since an automatic pipette is used. 


Now if we consider instead of inoculating or killing in the routine 
manner, inoculating one tube of each pair of duplicates starting from 
the first rack and going through the last rack and then return inocu- 
lating or killing the second of each pair from the last rack through 
the first rack, the average time of inoculation and the average time of 
kill will be the same for each pair of tubes. 


This can be made clearer if we order our tubes in the racks in the 
manner of slide 6. This procedure will not only equalize the time from 
rack to rack, the point we have been trying to achieve, but also will 
equalize the time for each pair of tubes. If our reasoning is correct 
this should minimize within rack differences as well as between rack 
differences thereby eliminating all bias since the average time during 
which growth can occur will be the same for all pairs of tubes. Of 
course the variation between the so called duplicate tubes will in- 
crease. This source of variation can then be corrected for the revised 
processing order effect by analysis of covariance to give an estimate 
of technician precision. 


If successful we will wind up with a systematic procedure which 

1) is suitable for routine purposes 

2) requires no extra work 

3) has simple potency calculations from the sample reading 
averages 

4) has little likelihood of being confusing 
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5) and provides a valid estimate of the assay error, 
This procedure was tried first by using the revised ordering during the 
kill step only. The results on analysis are in Table VI. Graphically 
they plotted as in slide 7. There is no longer any downward trend of 
the averages and there is no significant differences between racks. 
This was exactly what was desired. All rack to rack bias was eliminated. 


It was then decided to consider this revised ordering for both 
inoculation and kill. This would provide a systematic ordering from the 
beginning to the end of the assay. The results of this experiment were 
so promising that a second verification run was made and the experimen- 
tation concluded. Slide 7 is a graphical picture of the results of one 
of those experiments. 


To estimate technician precision an analysis of covariance with the 
revised processing order was performed. Essentially this is done by 
determining the variation of each individual measurement from a linear 
regression line passing in order from the first to the last tube proc- 
essed at each concentration. 


In all cases the magnitude of the error between duplicate tubes, 
corrected for the revised processing order, was no greater than the 
concentration by rack interaction originally present. This corrected 
error term was now a valid estimate of the assay error and there was 
no longer any significant bias to affect potency calculations. It was 
now a simple matter to calculate the confidence limits originally 
requested. The 95% confidence interval was found to be +5.5%. 


Implementation of the Solution: 





With six operators to be taught the new technique, supervision ex- 
pected to have to throw away a few days work, eat least, before a suc- 
cessful assay could be completed. However, after the people had been 
given a full explanation of why the change was being made, the revised 
procedure was put into regular operation the first day. From then on 
it was a success. Those of you who are acquainted with microbiological 
assays will agree that a 95% confidence interval of +5.5% is extremely 
good for this type of method. 


This technique has since been applied to other turbidimetric micro- 
biological methods in the same laboratory and the application has been 
completely successful. 


At this time I would like to cite a remark the well known statis- 
tician, R. L. Anderson, made at a conference in November, 1956; "If 
the experimental units do differ widely on some measurable character- 
istic, x, it may be desirable to use a balanced assignment plan, so 
that the average value of x is the same for each treatment, and then 
use an analysis of covariance to obtain an unbiased estimate of the 
error variance. I predict that you will hear more of this procedure 
in the future." The study I have discussed was made in January, 1956 
and we had been using this procedure for almost a year before Anderson 
made his remark. You will hear more of this technique in the future! 
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KEEP IT SIMPLE - - IT PAYS 


Axel U. Sternlof 
American Steel and Wire Division 
United States Steel Corporation 


Many of our well established industrial enterprises have learned of 
the near-fabulous mushroom growth of statistical quality control as ap- 
plied to manufacturing processes. They have seen stories in the various 
trade journals of the thousands of dollars that have been salvaged from 
the industrial junk-heaps as well as of the tremendous improvement that 
has been made in product quality through the use of this technique. They 
have heard of what can be accomplished through the use of statistical 
techniques involving correlation and regression; they have also heard 
of the fantastic savings through statistical design and analysis of ex- 
periments. Consequently, these modern management tools have been looked 
upon as & veritable panacea for all manufacturing ills and a great many 
manufacturers have been guilty of putting "the cart before the horse”. 
The application of these so-called advanced techniques of quality con- 
trol to manufacturing processes unquestionably has its place in our 
every day industrial life, and intricate problems are solved in a com- 
paratively short time. But what about the basic or the fundamental pro- 
cesses? Are they in a state of statistical control? If so, is that 
control satisfactory in relation to the tolerances or the specifica- 
tions? In many cases it is the application of the simple or fundamental 
techniques of quality control that reveals the gold in the industrial 
mine. 


Let us take a look behind the scenes of an electrical wire and 
cable plant. Here we find a typical job shop with hundreds of different 
products, using many different materials, frequent set-up changes, with 
few product duplications, many of them being manufactured to trade speci- 
fications, to customer specifications, etc., which frequently are vastly 
different. 


Then, too, we have the situation where we must judge the whole pro- 
duct by what is evident from an examination of end conditions only. We 
cannot cut a cable in the middle to check its characteristics. However, 
in spite of all these variables, there are applications of quality con- 
trol that can be used to pay off handsomely. Let us look into this situ- 
ation and see what we can learn. 


First, we have the drawing of the copper wire to the tolerances 
specified by the Product Engineering Department. I have selected one 
size out of our actual experience in the plant to illustrate how the 
dividends materialize. This particular wire was drawn to the tolerances 
of 0.0122" to 0.0123" and had been processed to this specification for 
years. Average and range charts were kept on the diameter and the physi- 
cal characteristics such as per cent elongation and tensile strength. In 
the matter of this presentation we are interested solely in the diameter 
of the wire. Charts for this particular eharacteristic showed very good 
control. Now and then a point on the average or the range chart would 
go out of control as the drawing die deteriorated. 


Shortly after the installation of quality control in the plant, we 


had an inquiry from the engineers in relation to the diameter of this 
particular wire. It was explained a customer had complained on a product 
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in which this wire size was used. The Quality Control Organization ws 
requested to look into the matter and after a detailed survey which took 
into consideration every variable that entered into the process, we con- 
centrated on the wire diameter. Our original data sheets were available 
for all characteristics for several hundred tests and after exoneration 
of all the other variables we went back to study the diameter of the 
stranded copper conductor. On investigation we found that the specifi- 
cation called for a seven-wire stranded conductor with a nominal diam- 
eter of 0.0360" and tolerances of 0.0351" to 0.05369". In a seven-wire 
concentric stranded conductor, that simply meant that each wire should 
have a nominal diameter of 0.0120". When the data was charted for the 
stranded conductor diameter the interesting situation as shown in Figure 
#1 was portrayed. This chart represents 100 reels of stranded wire, the 
individual measurements being arranged in groups of 4 for control chart 
purposes. Figure #2 shows our quality report to the engineers which 
gives a very good comparison of product versus specifications. The high 
"per cent above specifications" does not mean bad product. It does show, 
however, that since the product is sold by the foot, we were giving away 
copper. Continuing our investigation it was disclosed that at some time 
in the distant past the tolerances of 0.0122" to 0.0123" had been stipu- 
lated as necessary in order to produce a finished wire with the nominal 
of 0.0120". The extra 0.0002" approximately were added to take care of 
"pull down" in the subsequent processes of tinning and stranding. It 
was plainly evident from hundreds of measurements made by the Quality 
Control observers that this allowance was not necessary. New tolerances 
of 0.0119" to 0.0121" were finally adopted, which m & sav 

42 in copper. This saving was very easily translated into dollars and 
cents for a report to management. This was merely one of many similar 
illustrations of factual data proving conclusively that opinion can be 
wrong. 


To complete this picture let us look at Figures #3 and #4 which 
show the results of the process after the change had been put into ef- 
fect. Figure #5 is a chart showing one-half of "before and after" 
charts for comparative purposes. 


For the second half of this presentation let us compare two sam- 
pling plans; the first is illustrative of a rather complex plan; the 
second, of a simple plan. We will compare the amount of inspection in- 
volved under each plan and the end results of both. The product in- 
volved is an electrical wire known commercially as "building wire", the 
insulation being of the type known as thermoplastic and is in the 2/64" 
insulation thickness class. 


The inspection station is located at the automatic coiling machine 
where a reel of wire of several thousand feet in length is automatically 
coiled and cut into coils of 500 feet. For the purpose of this compari- 
son each reel is considered a "lot" and the size of the lot is dependent 
on the number of coils obtained from the reel. As each coil comes from 
the machine, a piece approximately a foot long is cut from the end of 
every coil. It is from these end pieces that we are to select our 
samples at random, measure the characteristic, and determine the fate of 
the lot under both of these inspection plans. When the sample has been 
selected under the first plan and the individual pieces have been tested, 
all the pieces are then cut to the same length and the sample selected 
at random for the second plan and the pieces tested again. In this man- 
ner there will be no bias or prejudice in the selection of the various 
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samples. 


For this comparison, the lots are grouped into sets of 25 reels 
each and we have 12 such lots representing a grand total of 4,954 coils. 
Furthermore, the bare conductor in these lots varied in size and type, 
such as solid and stranded. The common criteria here are type and 


thickness of insulation. 


The plan which I refer to as being rather complex is under con- 
sideration by a committee from the I.P.C.E.A. (Insulated Power Cable 
Engineer's Association) and is styled as Project #356. The following is 
quoted from the plan: 


"Section 2, Minimum Wall as Criterion 
" seeeeeee The normal sigma for a manufacturer entering this plan shall 
be the 100% column. 


— 6 6 ® 


"Section 4, Sampling Plan 
"See Tables I and II 


bl ctions For U Variable in, 


"For given type of insulation and lot size, take sample of size of 
first value given in cumlative sample size column. If the average of 
the values in this sample is equal to or above the correspending lowest 
acceptable average given in the proper sigma colum, the lot is accepted. 
If the average is below this value, a second sample is taken such that 
the total sample size is equal to the second value given in the sample 
size colum. If the average of the total sample is equal to or above 
the corresponding lowest acceptable average, the lot is accepted. If 
the average is below this value, the lot is rejected. 


m t or Cha 


"If the ranges of four consecutive lots exceed the corresponding 
value, the plans for the next higher sigma must be used. 


"If the ranges of twenty consecutive lots are all equal to or less 


than the corresponding R, value, the plans for the next lower sigma may 
DO USSM. cccccccccccce ." 
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Table I (In Part) (1) 
Variables Sampling Plan 


Insulation lot Cumulative Lowest Acceptable Average Absolute 
Size Sample In Mils Minimum 
Size Sigma Equal To, In Percent Mils 
50 75 100 125 








Thermoplastic 
31 Mils In 
Thickness 1 - 3* 2 27.0 28.3 29.5 30.8 
+ 26.7 27.8 28.9 30.0 
4-10 2 27.0 28.3 29.5 380.8 o6 
4 26.8 28.0 29.1 30.3 
11 and 5 27.1 28.4 29.7 31.0 
Over 10 27.0 28.2 29.4 30.6 
*Each coil is tested separately taking one (or two) 
pieces from each end of coil. 
Table II (1) 
Requirements for Changing Sampling Plan 
Insulation Lot Cumulative Sigma Equal To, In Percent 
Size Sample 50 75 100 125 
Size = Ry OR Rp Re Rp OR Rp 
Thermoplastic 
31 Mils In 
Thickness 1 - 3* 2 = 3 $s 3 4 - 6 
os 3) hCU- 4 4 5 6 ~ 8 
4-10 2 1 - e §& 3 4 - 6 
4 306hCt- 4 4 5 6 - 8 
11 and 5 S$ - 4 4 6 6 - 8 
Over 10 4 - 5s § 7 7 - 10 


*Each coil is tested separately taking one (or two) 
pieces from each end of coil. 


So much for the details of the I.P.C.E.A. Project #356 plan. The 
so-called simple plan is akin to that recommended by the ASTM Task Force 
in their tentative specifications for the control of copper wire physi- 
cal properties such as per cent elongation and tensile strength. This 
plan suggests that, regardless of the lot size, a random sample of four 
is selected for the measurement of the characteristic involved. I feel 
that it is imperative to quote from the notes in the A.S.T.M. specifica- 
tions for "assurance" that the plan will work; "Note 5. Specifications 
for Hard Drawn Copper Wire B 1 - 53T. 


"Cumulative results secured on the product of a single manufacturer 
indicating continued conformance to the criteria, are necessary to in- 
sure an over-all product meeting the requirements of these specifica- 
tions. The sample sizes and conformance criteria given for the various 
characteristics are applicable only to lots produced under these condi- 
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tions" (Ref. 2). The intention from the quote is Oe: plain; namely, 


t a control ¢ verage and faithfu a big as- 
set leading toward quality assurance. The A.S.T.M. ~~ states that a 


second sample be taken under certain conditions. However, even though a 


second sample may be required at times, the conformance criteria remain 
the game as far as the characteristics are concerned. 


In order to conserve both time and space, the details of the many 
data sheets are not reproduced here, a summary being used in its stead 
for the comparison of the two plans under consideration. Figures #6 
and #7 might be typical of the sort of control chart that might be used 
under the I.P.C.E.A. plan. The solid lines shown on the charts are not 
control limits; they are the criteria limits that are determined by the 
sigma column and sample size that might be in use at the moment. 


Let us refer to Figure #6. This shows pictorially the data of the 
first 25 lots of reels inspected. Note in particular that the range of 
the first 20 lots in succession is better than that called for under the 
100% sigma column, and note,also, that two lots stand rejected. Conse- 
quently, at lot #21 we are entitled to inspect under the criteria found 
in the 75% sigma column, which criteria will now accept the two lots 
that were rejected under the 100% sigma colum. Figure #7 shows the 
performance of the second 25 lots and note that one lot stands rejected 
under the 75% sigma colum criteria. During the inspection of the third 
set of 25 lots, we were entitled to go to the 50% sigma column, the low- 
est of the plan, and now we can pass the lot that was rejected under the 
75% sigma column. 


As we examine the plan and the charts shown here, I would like to 
call your attention to the following facts: 

1). In quality control work, it is quite the customary practice that 
once a process has established itself, control charts, with limits 
as previously determined or set by the process itself, are placed 
into operation in the plant. Under the IPCEA Project #356 no such 
limits are possible: only specification minimum can be shown and 
these can be located on the chart only after the actual testing. 
The reason for this is that the minimum specification can be deter- 
mined only by the final sample size and the particular sigma colum 
being used at the time. HERE IT IS THE PLAN, NOT THE PROCESS, THAT 
DETERMINES THE LIMIT. 


2). The criteria of both the minimum wall thickness and the range are 
not fixed, but are determined by the sigma column and the sample 
size used at the moment. 


3). It is my understanding (from a member of the Project Task Force) 
that this plan is not intended to be used as a control for manufac- 
turing processes but, rather, as a plan for the inspection of in- 
coming material. 


If this be the case, I can imgine a vigorous protest from those 
manufacturers whose processes are controlled at the criteria of the 50% 
sigma column, only to see that very same product rejected by the custom- 
er who inspects under the standards found in the 100% sigma colum. This 
plan then penalizes the manufacturer who has learned to control the pro- 
cess with very low variation in the minimum wall thickness. I am not 
contesting the statistical soundness or validity of the plan for it is 
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not the purpose of this paper to enter into the arena of academic argu- 
ment. I do, however, seriously question its practicability from an in- 
dustrial viewpoint and object strenuously to its required use because 
the plan would increase inspection costs considerably. Such increase 
must be passed on to the customer in the price he pays for the product. 


Let us now compare the results under the two plans. I wish to 
point out that the measurements were made by using a pin gauge and re- 
cording these to the nearest .0001". Averages were carried out to six 
decimal places and no "rounding off" took place until this final summary. 
Three hundred lots or 4,954 coils were sampled under each plan. The time 
period covered by the data for this comparison is not confined to a 
week's run, but was purposely spread out over several months. We mst 
realize that the "over-all" distribution will best be approximated if 
the data under consideration covers many operators, machines,set-ups, 
etc. Today's run may be at the lower portion of the over-all distrib- 
tion curve and next week we may be operating at the upper portion. To 
estimate the standard deviation from the data obtained from any one set- 
up or run am translate this information as being typical of the process 
over a length of time might lead to false conclusions. 


Plan Coils = ; = t = ’ 
Tested x Oo X - 30 X+ 30 

IPCEA 1881 .0297 . 001057 0265 .0329 

n=4 1220 .0300 .001036 0269 .0331 


In order to calculate the standard deviation overall, it was neces- 
sary to revamp the data under the IPCEA plan into sample sizes of 2, 4, 
5 and 10 pieces, make the calculation for each sample size and then 
weight the totals for the final results. This standard deviation overall 
(0,) calculation is the same as that used by the A.S.T.M. Task Force and 
takes into consideration the following: 


1). The standard deviation of the averages ...........-. etened Oo 


2). The standard deviation within the groups ....ssccceseceecs o 


$). A correction factor (n = sample size) .......sssee0s: ree Oy 
4). The standard deviation among the groups ........ pésenuden , e 
5). The standard deviation over all .....sscccccccccccccsseces o, 


The formula for this calculation is as follows: 


yi4d-e>: f ° 0, 
“Ti 


Applying the above formula and calculating the standard deviation 
overall for the data of the two plans, we have: 





Plan Coils = 
Tested x v, X - 30, X + 30, 

IPCEA 1861 -0297 -001228 .0260 -0334 

n=4 1220 .0500 .001449 -0256 -0345 
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A word of further explanation might now be in order as we compare 
the above results. It seems evident that the "100% sigma” column of the 
IPCEA plan is the average obtained under the industry-wide survey con- 
ducted by the Project #356 committee. The "absolute minimum" of 23 mils 
stated in the plan could be the industry-wide average minimum wall 
thickness minus three standard deviations. In quality control language 
we say that this value could occur atout three times in one thousand. We 
must remember that this survey was made from data submitted by many pro- 
ducers testing their own product which was manufactured to meet the 
present trade specification of .028" or 28 mils as a minimum wall thick- 
ness. 


From the above tabulation, we can see that the results obtajned 
under both of the sampling plans are practically identica]. The only 


significant difference between the two plans is indicated in the rela- 
tive amounts of inspection necessary to obtain these identical results; 
the IPCEA Project #356 required approximately 50% more Testing than did 
the AST n = 4. 

So from this comparison it is quite evident to see that to 


"Keep It Simple - It Pays". 


References: 
(1) IPCEA Project #356 


Second Draft April 14, 1955 
"Dimensional Tolerances". 


(2) AST Standards on Metallic Electrical Conductors 
October 1954, page 335, Note S. 
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Figure 2 


QUALITY CONTROL REPORT Strand Diameters 


b—-- -- ---- 





Process Center 20373" 
Process Capabilities 0361" - .0385" 


LSL = Lower Specification Limite 0351" 
USL = Upper Specification Limit-  .0369" 
5 = Mean of Specifications= 20360" 
R = Mean of Observations 20373" 
CO = Standard Deviation= .000388 
X1-XR-LSL- .0373 - .0351 = .0022 
X2 = USL - K+ .0369 - .0373 = .0004 
Cc -X -®- 20373 = .0360 = ,0013 
off-center relative to mean of specifications. 
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vonu teres Figure #4 
QUALITY CONTROL REPORT Strand Diameters 
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Process Center 20360" 
Process Capabilities 0355" - .0365" 


LSL + Lower Specification Limite  .0351" 

USL + Upper Specification Limit- 0369" 

Sv? = Mean of Specifications=- 20360" 

R = Mean of Observations- 20360" 

O''= Standard Deviations 000175 

X1-K-LSL- 0360 - .0351 = .0009 

X2 = USL-X%- 0369 = .0360 = .0009 

Cc -X -®- 20360 = .0360 = 0 Amount production is 
off-center relative to mean of specifications. 


ee = 51, Year Average Sed. Dev. *eBS 
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RECENT RESEARCH ON STATISTICAL PROBLEMS IN SUBJECTIVE TESTING 


Ralph Allan Bradley 
Virginia Polytechnic Institute 


1. Introduction 


There is widespread interest in the design, conduct, and 
analysis of experiments involving the subjective opinions of 
samples and panels of individuals. Applications arise in 
food processing, photography, distilling and brewing, textile 
research, wood technology, petroleum products research, and 
in a host of other areas of research. 


Many statistical problems occur in subjective testing 
that require further attention. We shall present here the 
results of recent research, with illustrative examples, on 
techniques that deal with the sensitivities of scoring scales 
and with the variabilities of judges or respondents using 
scoring scales. While these methods relate to the use of 
scoring scales as they are widely employed, we also remind 
the reader that it may often be desirable to employ simple 
ranking methods in subjective testing. 


2. Sensitivity Comparisons 


In the development of scoring scales and other experi- 
mental techniques, it is often desirable that two alternative 
methods be compared. Cochran [1] discussed the comparison of 
different scales of measurement for experimental results and 
indicated where further research was required. We have pro- 
vided means of comparing the sensitivities of similar experi- 
ments in two recent papers (Schumann and Bradley [2], Bradley 
and Schumann [3]). This recent research permits a test on 
the equality of the parameters of non-centrality of F-dis- 
tributions associated with tests of treatment equality in two 
independent but parallel experiments containing the same set 
of treatments in identical experimental designs. The experi- 
ments may differ in the scoring scale used or in some other 
criterion of measurement that does not interact with treat- 
ments. 


Good experimental data to illustrate the method to be 
presented in this section were published recently by Kauman, 
Gottstein, and Lantican [4] who were interested in the qual- 
ity evaluation of dried veneer. They used two evaluation 
schemes: "numerical" with a zero-score being excellent and a 
score of fifty being very bad and based on assigned scores 
for various types of degrade, and "subjective" with quality 
appraisal ratings from zero, excellent, to eight, very bad. 
Twenty selected sheets of veneer were evaluated by three ob- 
servers, twice with each scheme with repeat observations 
spaced by several days and orders of presentation of sheets 
changed. Complete tables of scores are given in the refer- 
ence; we require the summaries of Table 1 below. 
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Table 1 
Analyses of Variance for Quality Ratings* 






































Degrees |Numerical scheme|Subjective scheme 
Factor of 
Sum of Mean | Sum of Mean 
freedom| squares; square] squares| square 
Sheets (S) 19 12826.16| 675.1 336.90 17.73 
Observers (0) 2 170.72| 85.36 3-70 1.852 
Repetitions (R) 3 168.13 56.04 0.61 0.2042 
Interaction(SO)| 38 823.61} 21.67} 30.12 0.7928 
Error (SR) 57 595-37| 10.45| 22.13 0. 3884 
* 


A reproduction of part of Table 6, Kauman, Gottstein, and 
Lantican [4], page 148. 


We apply our method to compare the sensitivities of the 
two evaluation schemes, the test being one to ascertain 
which of the two schemes is more sensitive for the exhibi- 
tion of quality differences among the sheets of veneer. De- 
tailed discussions of the procedure used are given in the 
cited references, and we here illustrate the procedure only. 
We restrict our interest to the twenty sheets of veneer in 
the experiment and assume Model I of the analysis of vari- 
ance. 


The expected value of the mean square for sheets is 


t 
(1.2) BM.S.(S)] = 0? + k E t2/(t-1) 


1 
in general for t sheets and k observations on each sheet. 
Ty is the "effect" of sheet i, i =1,...,t. In the examples, 


20 
E([M.S.(S)] = 02 + 6 £ 12/19. 
i=1 + 


(2.2) 


o* is the expectation of the error mean square in both (1.2) 


and (2.2). The parameter of non-centrality of the F-test 
for sheets is, in general, 

t 
(3.2) N= kf t#/202 

i=1 1 


and, in the examples, 
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; 20 
(4.2) mR = 6 £ 2/202 
i=1 * 

when the F-density is written 


(5.2)  £(F) = (a/b)®{B(a,b) te F8-1 (1+aR/b)~ (a+b) 
* ,F [atb, a, adF/b(1+aF/b) J], O< F < 


where F has 2a and 2b degrees of freedom, ,F, is the conflu- 
ent hypergeometric series, and B represents line beta func- 
tion. A is a parameter expressing the magnitudes of treat- 
ment effects in a scale in terms of the experimental error 
associated with the scale. X is the appropriate parameter 
to measure the sensitivity of a scale. 


We shall test the hypothesis, Hp: \ = %5, against the 
alternative, H,: # \», using the subscript 1, for the 
numerical scheme, and 2, for the subjective scheme. 


To apply the test, we compute the two F-ratios with 19 
and 57 degrees of freedom (Now a = 9.5, b = 28.5.) and ob- 
tain 

> 64.60 and Fo = 45.65. 


The statistic used is 
(6.2) w= F)/Fo = 64.60/45.65 = 1.42. 


The distribution of w under H, depends on nu. = = No which 
in general is unknown. In practice it is clear thatthe 
test is not very sensitive to small changes in X and we, in 
fact, estimate %\ from the data using 


A 
(7.2) Ns 
t 


In the examples, 


a(F;-1), i=l, 2. 


A 
and 
A 
No > 9.5(45.65-1) oad 424.2. 


A A A 
We take X% to be the average of Me and Nos 


(8.2) XR = 3(604.2 + 42h.2) = 514.2. 


145 








A table of values wo such that P(w > wo|H,) = .05 is given by 
Bradley and Schumann in the cited references. To enter this 
table, one requires 


(9-5 + 514.2)#/(9.5 + 1028.4) 
264.2 


(9.2) a’ = (at+dX)2/(at2n) 


i} 


and b = 28.5. The table is symmetric in the sense that 
wo(a’,b) = wo(b,a"), and we obtain w, 41.85 by consulting 
the table. Row H., aS postulated, is two-sided and hence the 
significance level being used is .10. win (6.2) does not 
exceed w,, and consequently we do not reject H, at the 10 per 
cent level of significance. We are in accord with the auth- 
ors (Kauman et al.) who state "the present experiment has 
shown that the subjective evaluation can yield results of an 
accuracy approaching that of the numerical scheme, although 
the accuracy of the latter was slightly superior." 


Applications are limited since a table is only available 
for a one-sided 5 per cent level test. Schumann is preparing 
additional tables that are nearly complete. 


The procedure may also be applied to compare multiple 
correlation coefficients under certain conditions and an ex- 
ample is given in reference [3]. 


3. Judge Variability and Judge Comparisons 


When items are scored in subjective experimentation, 
there is no knowledge of the "true worth" of the sample in 
the units of the scoring scale. It is then difficult to as- 
sess the judging ability of a judge. Russell and Bradley [5] 
have provided means of estimating the variability of a judge 
in terms of the deviations of his scores for an item from 
those of the remaining judges but permitting a judge a possi- 
ble constant bias in his assignment of scores. Similar pro- 
cedures were considered by Grubbs [6] and Ehrenberg [7]. 


Consider a two-way classification with t items or treat- 
ments and r judges. The model with fixed effects is 


(1.3) = pt ty + By + Gy, i= Leoeyt, J = leoeyr 


Vij i 

where Vij is the score assigned by the jth judge to the jth 
item, j is the grand mean, the average level of judging, Ty 
is the effect of the ith item, 8; is the effect (or bias) 

of the jt" judge, and €;4 are independent normal variates 
with zero means. Contrary to the usual model of analysis of 
variance, we admit the possibility of heterogeneous error 
variances in the sense that 
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S45 «al 
(2.3) B(6},) 04, 


a is the variance of the jth judge and is to be estimated. 


The estimator of oF to be used is 





A rG, E 
(3.3) o2 = J 
j (t-1) (r-2) (t-1) (r-1) (r-2) 

where 

. 2 
(4.3) G; - as -%., * ye 5 2,3 
and 

t 7 . 
(5.3) E = a. PA ~The * y.j + Pe ’ 


the latter being the error sum of squares from the analysis 
of variance of the two-way classification. &% is an unbiased 
estimator of o% but, like an estimate of a vatiance compo- 
nent, may occationally be negative. In (4.3) and (5.3), yi. 
is the average of scores for, treatment i, y.; is the average 
of scores assigned by the je judge, and y,.“is the average 
of all scores. The requirement that &;j in (1.3) be normal 
is only met approximately in use of a discrete scoring scale 
but does not affect the estimation of o%. In later para- 
graphs of this section, we shall assume*that departures from 
normality do not seriously affect our test procedures. 


We shall again illustrate this work using the data of 
Kauman et al. The detailed example is for Test 1 using the 
subjective scheme. Scores are listed in Table 2. In Table 3 
we show values of the residuals, (yj; - yy.. - he mS 
Values of G; and E are given in the lower margin of Table 
and are obt&ined by accumulating the squares of entries in 
the columns above as required in view of (4.3). E= £3265 
and was so obtained. The values of $2 computed using ” 
(3.3) are listed in Table 4 along witH those for the other 
three tests of Kauman et al. To illustrate the computations, 
we use observer A and obtain 


_ 3(8.36) _ __22.50 
(19)(1) (19) (2) (1) 





A> . 
oF = 0.72. 

A test of homogeneity of variances is possible only when 
r= 3. The only situation wherein the estimators $*% of 02 
are maximum likelihood estimators is when r = 3 and*then adn 
approximate test may be made. Consider the hypothesis, 
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Table 2 Table 3 


























Quality Ratings for the Values of (yj icYi .-Y. ity...) 
Geujoctive Gialicy, for she, sunpebeste Gulnsty 
Observers Observers 
=| & BB Cl Ms, —") & B c 

1/ 3 3 3 }3.00 1] .19 -.09 .09 

2} 7 5 7 [6.33 2] .86 -1.42 .58 

3 6 5 5 15-33 3 ~86 -.42 -.42 

h} 7 8 7 17-33 ho | -.1h =. 58 = 2 
5/1 2 3 42.00 $1 +.81 «=.09 92 
6/5 5 5 15.00 6} .19 -.09 ~-.09 

7 5 6 5 15.33 7 | --14 258 9-042 

8 3 5 4 |4.00 8 | -.81 91 -.09 

9} be5 5 5 |e 83 9] -.14  .08 .08 
10| 6 7. 7 {6.67 10 | -.48 2 2 
11 | 5 & = & }he33 1l | .86 -.42 +242 
12} 8 7. 8 17.67 12 | .52 -.%6 2 
13 5 7 5 15.67 13 | -.48 1.2h -.76 
mw] 2 #2 = 2 1as67 th | --4h8 «2h 0 020th 
15| 7 #7 #427 47.00 15 | .19 -.09 -.09 
146) 1 #3 & 12.67 16 |-1.48 .2h 1.24 
17 4 3 3 13-33 17 -86 -.42 -.42 
18 6 6 5 |5.67 18 ~52 -2h -.76 
19} 5 5 7 1567 19 | -.48 -.76 1.24 
20} 3 3 2 12.67 20'| .52 .2h -.76 
Yoj |he62 4-90 4.90}7* 5, G; | 836 7.07 7.07 



































E=22.50 


_" Table 3, Kauman, Gottstein, and Lantican [4], page 
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Table 4 


Estimates of Variance and X§ to Test for Homogeneity of 
Observer Variances for All’ Four Tests of Kauman et al. 











Observer Error Mean 2 
Tests Variances, a2 Square, Re . 
A B C 
Subjective Test 1 72 53 53 59 elk 
Subjective Test 2 46 58 - 7h 59 + 26 
Numerical Test 1 | 4.58 10.79 33.02 16.13 6.42" 
Numerical Test 2 | 2.19 26.40 21.69 16.84 4.14 




















e 2= 2 = g2 
H,: oF o5 o3 


and the alternative, 
H,: oF # oF for some j and k, j, k= 1, 2, 3. 


The likelihood ratio test statistic, distributed approxi- 
mately as a X*-variate with 2 degrees of freedom for large 
samples, is 


(6.3) X2=-(2. 3026) (t-1)[2 log(t-1) + log (8263 + 8262 + & 
- 2 log E + log 4/3] 
= -(2.3026)(19)[2 log 19 + log {(.72)(.53) 


+(.72)(.53) + (.53)(.53)} - 2 log 22.50 + log 4/3] 
=0.14. 


The multiplier, 2.3026, in (6.3) is included so that common 
logarithms may be used in the computation of X*. The small 


value of X* indicates that the observers may be taken to have 
homogeneous variances. 


In Table 4, we have included values of X2 for all four 
tests and show also values of 0*, the error méan square from 
the analysis of variance. Note that only in one of the nu- 
merical tests was x significant at the 5 per cent level of 


significance. The estimates of variance in the numerical 
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scheme are considerably larger than in the subjective scheme. 
This does not of course suggest a preference for the subjec- 
tive scheme but is merely a result of the scales used in the 
scoring methods. The appropriate method of comparing the 
scales is the one given.,in the preceding section. 


Observer A was the only observer with previous experi- 
ence in judging veneer except for brief training sessions be- 
fore the experiment began. Another test, and this is an 
exact test, is possible. Consider the null hypothesis 


H of = o*, given of =... = a2 = 92, 


°° 
and the alternative, 


H,: of < o?, given of = ... = o2 = 0%, 


a® % 2 


the statistic used is 
r(r-2)G, 


(763) ‘ (r-1)E-rG, 

with (t-l1) and (t-1)(r-2) degrees of freedom. H, may have 
either of the possible one-sided forms or be two-sided. For 
the form of H, shown, small values of F are significant. In 
the example, fhere is no point in testing Hy versus H, in 
this test in view of the homogeneity of variances demon- 
strated above. However, we shall proceed in order to illus- 
trate the method. 





— (3) (1) (8. 36) 


~ 2) (22-50)-3(8.36) ~ 2°27 


With 5 per cent level of significance, we obtain the critical 
value of F by reading the tabular value of F with (t-1)(r-2) 
= 19 and (t-1) = 19 degrees of freedom from an F-table and 
taking its reciprocal. In this case, values of F < 1/2.13 = 
-47 are significant at significance level .05. This F-test 
is useful, for example, in considering the addition of a new 
judge to an established taste panel. Values of F with their 
upper and lower bounds for significance may be plotted to 
yield a rough control-type of study on the performance of a 
given judge in successive experiments. The fact that other 
judges do not necessarily have homogeneous variances should 
not seriously affect the procedure. A form for F, alternate 
to (7.3), may sometimes be preferred. This form is 


iy E+(t-1) (r-1) (r-2)8 
* rE-(t-1) (r-1)9? 





(8.3) 
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4. Discussion and Summary 


We have discussed two procedures that are of use in sub- 
jective testing wherein scoring scales are used. One deals 
with the establishment of suitable scoring procedures and ex- 
perimental techniques. The other permits an evaluation of 
the performances of judges using scoring scales and permits 
plotting of charts showing judges’ performances over series 
of experiments. We now note some additional procedures use- 
ful in subjective testing. 


Often in subjective experimentation, the items to be 
compared result or should result from factorial treatment 
combinations. But subjective evaluations may require the use 
of incomplete block designs. Kramer and Bradley [8, 9] have 
provided means of using factorial treatment combinations in 
incomplete block designs and this may often be useful in 
taste testing and related areas. An application of these 
methods is included in these proceedings (H. M. Hill and D. 
Wheeler [10]). 


Ranking methods are often used in sensory testing and 
may provide very efficient means of experimentation. Paired 
comparisons are used most frequently and we cite references 
{11, 12, 13, 14, 15]. Pendergrass f16] in work still unpub- 
lished has extended the models for paired comparisons to use 
in triple comparisons. Some gains in efficiency result in 
certain situations and his procedures also merit considera- 
tion. 
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QUALITY ASSURANCE TO THE ULTIMATE CONSUMER 
IN THE TEXTILE INDUSTRY 


J. B. Curley 
American Viscose Corporation 


The textile industry is a most essential industry to our comfort 
and well being. To those not directly concerned with the manufacturing 
aspect of this industry, it frequently comes as a surprise that there is 
an involved operation that goes into producing the carpet we sink our 
feet into, the gracious gown we behold milady wearing or the Bikini 
bathing suit we admire, 


Speaking of figures, we may take a moment to consider a few. The 
carpet we referred to has about 8000 textile fibers in each square inch. 
A fiber 400 miles in length is required to make a single square yard of 
the dress fabric, In each automobile tire we ride on there may be 4000 
miles of textile filaments. The activity of the textile industry con- 
sists of arranging and treating these textile fibers such that the final 
product has the desired properties of serviceability and aesthetic 
appeal, 


As can be seen, the quality of the final fabric or rug depends on 
the properties of the fibers used, the manner in which these fibers are 
put together to form the fabric, and the finishing operations used on the 
fabric. Fibers, fabrics, and finishes of widely differing characteristics 
are required for a dress or a rug or an automobile tire. 


Now the theme of this paper is “how does a manufacturer assure 
quality to the ultimate consumer"? I would like to explain to you how a 
man-made fiber manufacturer takes steps to assure to an ultimate customer 
that the fabric in the dress or the rug is of a functional quality from 
which reasonable service may be expected. 


There are two programs involved. One is an internal one to see that 
e fiber of specific characteristics is manufactured and to control the 
manufacture in order that a uniform material is produced. The other pro- 
gram is in some respects a novel one, A manufacturer of an “intermediate 
product in this vertical textile operation expends effort to see that the 
ultimate consumer receives a satisfactory product. 


In the abstract, a material can be described by its physical and 
chemical properties alone, without reference to what it is made from, how 
it was made, or how it will perform. Ina practical sense, we rely 
rather heavily on the knowledge of how it was made and how it will per- 
form. 


Let me explain how this applies to our industry, We purchase raw 
material against specifications, In addition, we purchase the suppliers 
knowledge of the material. This knowledge includes his description and 
also how it will perform, that is, his recommended processing behavior. 
As a regular check, the supplier sends us a monthly report on certain 
properties of the material he has sent to us. For wood pulp this would 
be percent cellulose, and percent of all other important components. 


Even if the purchaser has complete testing and inspection facilities, 
the manufacturer invariably knows more about a product than can be 
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learned by sampling inspections and testing. Some of the many reasons 
that this situation exists is the frequent inability of the purchaser to 
sample the material as adequately as the mamfacturer. Furthermore, as 
discussed previously, tests that can completely describe the necessary 
properties of a material are either time consuming or difficult if not 
nearly impossible to make, It is for this reason that we use the manu- 
facturers' knowledge of the material which includes the properties known 
to him from his manufacturing information and evaluations, and his 
knowledge of how the material can be expected to perform. 


On all raw materials we make check tests and inspections but these 
checks are only to verify the information with which we have been supplied. 
It requires little logic to see that if material from a particular 
supplier had a history of meeting the specifications, performed satis- 
factorily, and was properly described by the reports of the supplier that 
we would make only occasional tests and inspections on the incoming 
material. On the other hand, if we were not supplied routine reports of 
the suppliers tests to determine the extent to which he is meeting the 
specifications, then we have to carry out a more extensive test and 
inspection program. 


Now to move into the manufacturing operation. Many tests and 
inspections are made throughout each step of manufacture. With a few 
exceptions, it can be said that all of these tests and inspections in- 
crease our knowledge of the final product. These tests and inspections 
enable us to control the manufacturing specification which will produce 
a product of specific properties. This is really the meaning of the 
idiom that properties of a material cannot be inspected into the material, 
they must be manufactured into it. 


It is in fully utilizing these built-in properties that the consumer 
realizes a satisfactory article. To put it another wey, by passing on 
this knowledge of the properties of a product, a manufacturer can assure 
an ultimate consumer that he is getting quality merchandise. 


Again to turn from the general to the specific, American Viscose 
Corporation manufactures rayon and acetate fibers. These products of 
ours are raw materials to our customer. They are further processed be- 
fore they become consumer goods. 


The relationship between buyer and seller that has been covered is 
one that exists between American Viscose Corporation and its suppliers. 
American Viscose feels obligated to carry this relationship on to its 
customers, such that the ultimate product will have the full benefit. 


As mentioned previously, for the customer to get a satisfactory 
textile product, the correct fiber must be used, design and manufacture 
of the fabric must be adequate, and the finish or after-treatment of the 
fabric must be adequate for a specific end use. 


We have dealt here primarily with the manufacture of a correct fiber 
for a specific end use and the approach of passing on the knowledge, 
gained in manufacture of this fiber, to the yarn and fabric manufacturers 
and to the fabric finisher. 


It has too often been the case that even when the correct fiber has 
been used an unsatisfactory garment has resulted due to using the fiber 
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in a sleazy fabric construction or inadequately finishing the fabric. To 
the consumer the fiber is at fault - not the yarn or fabric or the finish. 
No better example could be given than the “explosive” sweaters of a few 
years ago. Farly conclusions by the consumer placed the blame on the 
fiber and not the fabric construction and finish the fabric had received. 


Our obligation of passing on complete knowledge of our product 
encompasses the desire that not only is our customer satisfied with the 
product and service that he gets, but that the ultimate consumer is 
satisfied. To do this we work with out customers, that is, the manu- 
facturers between us and the ultimate consumer. We say to these manu- 
facturers that if materials or fabrics are produced from our fibers such 
that a satisfactory consumer product results that we will assist in the 
promotion and advertising of these fabrics to the ultimate consumer. 
Symbol to the ultimate consumer of our approval of the fabric is the 
Avisco Integrity Tag. To the customer this tag, which is found on the 
fabric, rug, or garment, means that the fabric used will perform as 
claimed. 


In this program American Viscose manufactures the basic fibers. 
Since these fibers are man-made, they can be made with a variety of pro- 
perties, that is, tailor-made for specific end uses. We sel] these 
fibers together with our knowledge as to how they will process on our 
customer's equipment, and our knowledge and recommendation of textile end 
product design. All of this is aimed toward providing the customer with 
a product with which he is satisfied. In doing this we promote the pro- 
duct that carries the label of the Integrity Tag. This means that the 
customer will have faith in the quality and satisfaction obtained from 
textile materials bearing this tag. A friend has been made both for the 
manufacturer of the consumer product and for rayon or acetate made by 
American Viscose. 


Now what does this tag signify? It testifies that properties of the 
materials used have been evaluated and found to meet minimum requirements 
for satisfactory performance for the particular consumer product. These 
requirements obviously vary for the product involved. Fabric service- 
ability in relation to end use is the basis of the Avisco Integrity Plan 
requirements. These requirements of serviceability cover the important 
properties of the consumer product. 


Actually, in the textile apparel and home furnishing, the customer 
decides on an article first on its appearance, its aesthetic appeal. The 
dress has the desired color and shape, or the rug looks nice and again 
its color and shape are satisfactory to the eye. 


Secondly, the customer usually decides on an article by its feel. 
That is, it has the proper handle and "pedal". Permit me the use of 
these words to include feel by both hand and foot. We appraise rugs by 
stepping on them. These two factors in an appraisal of a textile con- 
sumer product are sensory ones. 


After the sensory decisions have been made, the consumer of textile 
materials then is concerned generally with the permanence and restora- 
bility of these values. In detail, these questions could be asked: Is 
the fabric strong enough for reasonable service? Will the strength be 
adversely affected by treatments it may receive? Will the size or shape 
of the fabric change - will it shrink? Are the colors permanent? Will 
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the colors stand exposure to light, rubbing, laundering or cleaning, 

. atmospheric exposure? Will the rug retain its original appearance? Is 
the rug density enough such that long wear and a good springy feel are 
assured? Will the tufted bedspread shed lint? Will the blanket mat dow 
or remain fluffy? When the material becomes soiled, how should it be 
handled to restore the original sensory values? 


In general, we are all capable of making the sensory decisions in- 
volved in deciding on a textile material. On the other hand, how does 
the ultimate consumer evaluate the capability of this textile material 
to retain these values? The consumer can purchase an article strictly 
on appearance and feel with no knowledge or assurance of its service- 
ability and, as such, obtain her first knowledge of permanence as the 
article is used. Sometimes this approach can be quite disastrous as was 
experienced with the "explosive" sweater already mentioned. When the 
textile article is purchased at a reputable store, the customer may get 
satisfaction if she returns the article when dissatisfied with its per- 
formance. 


One way for the customer to know about the fabric serviceability is 
for the customer to learn at the time of purchase how the material may 
be expected to perform and the care that is necessary. The garment or 
rug menufacturer's name label alone creates confidence in the merchandise 
but, unless that label is used on a single product with one end use, it 
is limited in the inference that can be drawn as to the serviceability. 
When this information is known from service or performance testing and 
laboratory testing, then the information can be passed on to the customer 
in the form of a tag or label on the material. Thus by mere inspection 
of such a tag the customer is assured as to the serviceability and in- 
formed as to the care that should be used. Such a program of testing and 
labeling is the Avisco Integrity Plan. By advertising and promotional 
work the customer has the assurance that he has found a reliable guide to 
the purchase of fabrics and textile materials that perform well. 


From a variety of different rayon and acetate fibers we recommend 
to our customers, yarn and fabric manufacturers, fibers of specific pro- 
perties. We make available our knowledge of the fibers and how they 
might be handled such that their properties are most beneficial to the 
manufacturer in making a fabric or textile material. Under our careful 
coaching fabric dyers and finishers improve their know-how of fabric 
quality control. Through a license agreement they are bound to comply 
with the requirements of the plan. 


The Integrity Tag is awarded only to fabrics that have met or 
exceeded test requirements. Among the standards used are the "American 
Standard Minimum Requirements for Rayon and Acetate Fabrics", know as 
ASA 122", published by American Standards Association and recommended by 
the National Retail Dry Goods Association. Use is also made of American 
Standard 124, "Minimum Requirements for Institutional Textiles". This 
standard was developed under the sponsorship of the American Hotel 
Association and is included in their “Hotel Textile Purchasing Guide". 
Fabrics which we are satisfied will pass the test requirements and which, 
in general, are manufactured wholly from Avisco fibers are eligible to 
carry the "Avisco" Integrity Tag or label through every branch of the 
trade and finally to the customer. 


Fabric converters and finishers are required to submit to us samples 
of fabrics that are to carry the Integrity Tag. These samples are 
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tested to see that they meet the above minimum requirements for a par- 
ticular end use. 


Basically there is one point of view, whether as manufacturers or 
as ultimate consumers, we are all purchasers and users. Just as the 
manufacturer buys wood pulp and uses it to make rayon, the customer buys 
a rug and uses it to enhance the beauty and comfort of his home. We buy 
a particular type of wood pulp with the assurance that we can use it 
satisfactorily. The ultimate consumer wants to buy a rug with the 
assurance that it will perform satisfactorily. For varied reasons there 
are limitations as to the customer's ability to appraise the properties 
that mean satisfaction before the material involved is used. In the 
same manner that knowledge of a raw material enables a manufacturer to 
make a product, so knowledge of a consumer product in the form of label- 
ing enables the consumer to purchase with assurance that she will 
receive satisfaction. 
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‘GUIDE TO THE USE OF STATISTICS 
IN THE CHEMICAL INDUSTRY * 


Re Se Bingham, Ire 
The Cartorundum Company 


Introduction 


Before we can decide whether statistical methods are in 
order, and if so which ones, it is important to consider the 
process under studys In chemical manufacturing certain 
characteristics are important to a quality engineer: 


le There is usually a series of steps involving physical 
and/or chemical changes; using 


2e a variety of raw materials from natural products, to 
finished chemicals (ore, sand, clay, eir, coal to gase- 
oline, coke); in which 


32 control requires understanding of 


(a) the properties of materials under reaction 

(b} material handling techniques (barge, pipeline, 
carboy, etce) 

(c) unit operation or process characteristics 

(ad) measurement method used in analytical labse 


Where to Start 





le Most logical place ~ control laboratory 





ae highly repetitious operations 

be need for calibration 

cs standardization 

ad. check on sampling 

@e influence of results on other studies 





2e Most profitable long range ~ research ami development 


@e the inferences made can be as broad as possible 

be experimental results can be compared with outcomes 
of other test programs 

Ce results can be measured against estimates of ex- 
perimental error associated with the test rather 
than some ideal or OS8e¢ 

de results will suggest next areas to studye 


3e Greatest immediate retum = process control 


@e leads to distinctions between natural fluctuations 
and reaction variations accounting for material 
losses or lower yields, etce 


*, more complete paper entitled "Guide to the Use of Statise 
tics in the Chemical Industry" has been accepted by 
Industrial Quality Control for publication. 
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be verifies meaningful “rules of thumb" « 

Ce reduces mumber of "adjustments" necessary 

de focuses attention on what the process actually 
does and which factors control ite 


Power of Statistical Methods 








le statistical "tools" as working implements and means 
to an ende 


2e general versus special purpose tools 
3¢ design capabilities, limitations in use 
he early disappointments 
&@e confidence limits too wide; underestimating experi-~ 
mental error 
be not “curewall" 
Ce don't replace technology or science skill 
S» statistical methods don't create original or final 
conditions, rather 


b) point out size of variability 


ie measure 
c) suggest correlation 


Questions Indicative of Application 


See Tables TI «~ IV 


Table I suggests possible use by orgmizational need, 
Table V lists edditional sources of informations 
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FACTORIAL CHI-SQUARE AS A SEARCH TECHNIQUE 


Ralph F. Huth 
U. S. Steel Corp. 


INTRODUCTION: 


Observations recorded as ratios, percentages, or proportions, are 
commonly referred to as attribute or "all or nothing” type data. Trans- 
forming such data into normalized forms often is time-consuming and re- 
quires considerable statistical knowledge. 


Much of the information collected and recorded during processing 
operations in a steel plant is of the attribute type. Careful exami- 
nation and analysis of this information even though it may be an “after 
the fact" or “hindsight” approach, often leads to interesting and useful 
conclusions. 


The method presented in this paper is aimed specifically at the a- 
nalysis of process data. Granted, it is better to look to the future in 
the form of designed experimentation utilizing controlled data collection 
conditions, but this is not always economically possible, and we cannot 
afford to ignore information which hasalready been gathered. 


METHOD: 


Factorial Chi-Square first became known to the writer in 1956 at Fs 
Montreal National Convention. A paper presented by Herbert C. Batson 1 
aroused sufficient interest to apply the technique to a current problem 
with encouraging results. Since then the approach has been used on 
several other problems, with considerable success. 


The basic Chi-Square formula generally appears as follows: 


x2 =D(E - 0)? (formula 1) 


Where E is the expected occurrence and O is the observed occurrence, 


Let us apply this to the following problem. The lost time acci- 
dents for a given year were distributed by shifts as follows: 


lst shift = 4 
2nd shift = 13 
3rd shift = 13 


Not knowing hours worked per shift or working condition differ- 
ences, we assume that the opportunity for accident occurrence should be 
equal for each shift. Translating this into the terms of our problem 
our expected occurrence for each shift becomes 10. Solving for 
Chi-Square with 2 degrees of freedom we get the following: 





2 
x2 = (10 - 4)? + (10 - 13)? + (10 - 13)" = 3.6 + .9 + 09 = 5.4 
10 10 10 


Our hypothesis is that there is no difference in lost time accident 
occurrence by shifts. Looking in a table of Chi-Square values for 
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2 degrees of freedom we find Chi-Square os = 5.99. We therefore are 
likely to conclude that there is insufficient evidence for the rejection 
of our hypothesis even though it may appear that the first shift has a 
tendency to have fewer accidents. 


Supposing our combined results for a two year period had been as 
follows: 


lst shift = 8 
2nd shift = 26 
3rd shift = 26 


Assuming the same conditions as in the foregoing example we would 
expect 20 lost time accidents per shift. 


The Chi-Square calculation then becomes: 


x2 = (20 = 8)? + (20 - 26)* + (20 - 26)? = 7,2 + 1.8 + 1.8 = 10.8 
20 20 20 





Our hypothesis that the accidents occur without bias by shifts is 
rejected at the .0l1 level of risk. (Chi-Square 01 2 degrees of freedom 
= 9.21.) 


From the two foregoing examples we see that Chi-Square is affected 
by sample size, just as are most statistical procedures. 


Table I gives a simple example of the use of Factorial Chi-Square. 
The illustrated problem deals with a fatigue bend test wherein a piece 
of metal is flexed a given number of times and if a crack or breakage 
occurs, the test piece is regarded as a failure. Two types of steel and 
two methods of processing are used as the variable factors. The data 
used is for specimen purposes only. 


Using the Brandt-Snedecor (2) formula (Formula 2) the total 
Chi-Square with 3 degress of freedom is found to be 14,27, 


x? = NAS xX F) x ss (Formula 2) 
(k-1) 
where k = number of subgroups 
S = number of successes 
F = number of failures 
N = total number of samples 


2 2 2 2 
SS = (6, /n1 +s2Q /ng + oe + 8k (me -S MW 


Little s and n are the successes and number of samples respectively 
in each subgroup. 


Formula 2 is much faster than formula 1 and yields the same results 


within rounding error. In this instance formula 1 gives Chi-Square = 
14,28 with no rounding error. 
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The lower portion of table I utilizes the Factorial Chi-Square ap- 
proach as outlined by Batson in the 1956 transactions. By this method 
the Chi-Square contribution of each effect is readily known and an inde- 
pendent assessment of the significance of each effect is made. 


Table II gives an example of the application of this method to a 
23 factorial designsan eight cell pattern using the three factors each 
at two levels. The computations are the same as shown in table I. The 
interaction which is obvious in table I is not as easily seen in table II 
until some of the computations are completed. In fact, there might have 
been a tendency to conclude that there was nothing significant in the 
data if only a total Chi-Square had been computed, as Chi-Square .o) 
7 degrees of freedom = 18.48. 


This then, is the basic Factorial Chi-Square technique. Next, let 
us adapt the method to some of the process problems of industry which 
invariably involve unequal cell sizes when arranged in factorial pattern. 


Table III gives an illustration of data of this type - unequal cell 
sizes. Again the data is for specimen purposes only but is indictative 
of typical steel problems. The successes represent the band or O.K. 
edges of approximately 1500 Hot Rolled Coils, The failures are edges 
that have cracked sufficiently to cause trouble in future processing. 
The three variables are chemical components of the steel broken into a 
high (+) and low (-).range. A typical analysis procedure might work as 
follows: 


Step One: 
Record all data on Punch Cards. 
Step Twos 


Find the midpoint of each variable and use this as the 
breaking point for determining high (+) and low(+) values. 


Step Three: 


Factorially sort the cards into the eight ceils or combi- 
nations required by a factorial pattern. On a problem with 
a relatively small number of cells such as in the example, 
conventional tab sorting equipment would suffice. However, 
if the number of cells were larger, 16, 32, 64 and on up, an 
I.B.M. 101 Statistical Machine would do a rapid job and give 
actual printed listings. 1500 cards could be broken into a 

factorial pattern with 32 cells in less than 5 minutes, 
provided_the board wiring was done ahead of time. However, 
once a board has been prepared additional 2° runs could be 
made with relatively little board wiring time. 


' Step Fours 
Compute the total Chi-Square using formula 2. 


Step Fives 


Compute the Factorial Chi-Square components using the method 
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as shown in table III. In table III the successes are reduced 
in the proportion of the cell size (n) to the smallest cell 
size, in this case 141, The ratios are rounded to the nearest 
whole number. The computations for Factorial Chi-Square are 
then made using the ratioed or prime values. N which equaled 
1498 becomes 1128, S which was 854 becomes 665 etc. Naturally 
our total Factorial Chi-Square is now less than that of our 
total Chi-Square because of this manipulation. 


Step Sixs 


It now becomes necessary to interpret and use our results. By 
using the Factorial Chi-Square technique we find many of our 
effects are significant. However, action should not be taken 
just because Chi-Square indicates significance. The Chi-Square 
test of significance acts merely as a stop and go sign. If the 
sign says stop (no significance) we will not proceed, but just 
because it indicates go (significance) doesn't mean we must 
take action. In our example the effect of copper was very 
strong. Our estimates showed that better than a 20% im- 
provement could be made if we worked only the low side of the 
copper range. Yet the cost of reducing copper in the steel 
might be prohibitive even if it could be done at all. The 
sulphur content possibly could be controlled, but would require 
some cost expenditure. We must then make a cost study to de- 
termine if reducing the sulphur would be economically feasible. 
Our best estimate is that an approximate 10% improvement would 
result if we worked to the low side of the sulphur range. Per- 
haps the silicon effect would be the cheapest to control and 
would afford the greatest improvement per dollar spent. The 
interaction BC, even though significant might not afford any 
increased benefits once the silicon is properly controlled. A 
proper examination of a 2-way table should help answer this 
question. Because corrective action is difficult or impossi- 
ble on some of the effects does not mean we should ignore these 
indications in future experimentation, rather, we should use 
the knowledge by properly designing future work so that these 
effects do not cloud the appraisal of other economically 
manageable effects. 


USE OF HIGH SPEED COMPUTERS: 


Attempts have been made to program portions of the Factorial 
Chi-Square technique on high speed computers. Probably the greatest 
benefit occurs in the selection of breakpoints for the determination of 
high and low groupings. The computers can select the breakpoints so that 
the smallest cell size is a maximum, Table IV shows a step by step pro- 
cedure for a proposed computer program utilizing Factorial Chi-Square. 
The steps of the program are as follows: 


Step Ones 


Prepare a frequency distribution and compute the average and 
standard deviation of each variable. 
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Step Twos 


Plot the percentage of successes for each increment of the 
frequency distribution for each variable. This enables a crude 
check for serious departures from linearity. 


Step Three: 


Determine the optimum break points for each variable so that 
the smallest sample size will be a maximum. In the actual 
running of a problem minimum cell size limits are given the 
computer and if this minimum is not met, predetermined varia- 
bles are automatically dropped until such time as the original 
minimum pocket size is met. 


Step Fours 
Calculate the total Chi-Square using formula 2. 
Step Fives 


Calculate the Factorial Chi-Square components using the 
smallest cell sample size as a basis for the computations 
(same method as table III.) 


Step Sixs 


Calculate the percentage effect of each cell for use in 
interpretation of results. The percentage calculations can be 
of help in presenting the data to management and can help 
answer the question of the percent improvement that can be 
expected under various processing condition combinations. 


SUMMARY: 


Some of the computational details of Factorial Chi-Square are pre- 
sented. In addition a technique for handling unequal sample sizes is 
presented along with a program outline for high-speed computers. While 
the outlined procedures will yield a Chi-Square value that can be tested 
for significance, the occurrence of a significant value is not always 
cause for corrective action. Tempering the results with good judgement 
will enable the experimenter to use this technique on many attribute 
type problems. The ease and speed of the calculations should be of 
benefit to any experimenter or analyser who is interested in quick ap- 
proximations with fair reliability on attribute-type data problems. It 
is again emphasized that the Chi-Square test of significance is used 
only as a guide, supplementing good judgement. 


In using unequal sample sizes the total Factorial Chi-Square sig- 
nificance level will always be less than the total Chi-Square value for 
the problem. If the total Factorial Chi-Square value is considerably 
less than the total Chi-Square value, it is an indication that the sample 
sizes are out of proportion to each other. If the sample sizes are ap- 
proximately equal and the minimum sample size is 20 or 25, the method 
should work quickly and satisfactorily. 
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WHO CONTROLS QUALITY - AND HOw? 


Warren R. Purcell 
Raytheon Manufacturing Company 


INTRODUCTION 


Picture a manufacturing organization with Inspection and Quality 
Control groups which perform their functions perfectly. The Chief In- 
spector is a genius at administration, as is the Quality Control Manager. 
All inspection, measuring, and test equipment is maintained in excellent 
condition. Inspectors never err, even on visual characteristics. The 
sampling plans in use maintain just the right balance between cost of in- 
spection and cost of wrong acceptance decisions. The Quality Control 
group supplies reliable quality data in useful form, and is well staffed 
with quality control engineers who have unusual ability to diagnose proc- 
ess ills quickly and accurately and make recommendations which, if fol- 
lowed, would eventually eliminate all quality troubles. 


Would such a manufacturing organization have quality troubles? 
Of course it would. 
Why? 


The answer is well known and has frequently been stated. 
It is this: quality is controlled by many persons, and most of them are 
in neither the Inspection nor Quality Control groups. They are in many 
other groups, including General Management, Accounting, Research and De- 
velopment, Design, Planning, Purchasing, Production, Production Control, 
Industrial Engineering, and Sales, 


How can these groups help in the control of quality? What, 
specifically, can they do? 


To help in answering these questions the following suggestions 
are made. Admittedly, the list is not complete. It is requested that 
other ideas which occur to readers be submitted in letters to the Editor 
of INDUSTRIAL QUALITY CONTROL. 


GENERAL MANAGEMENT 


1. Maintain a consistent quality policy. 


There wil? be occasional quality failures. Do not let an 
occasional failure, exaggerated out of proportion because of 
the particular persons affected, cause an excited drive to 
tighten up inspection. For this starts the oscillation of 
the pendulum between quality at the price of quantity and 
quantity at the price of quality. 


2. Keep informed on the total cost of quality. 


These costs fall into three broad categories: prevention, 
appraisal, and failure. This has been pointed out by 
Feigenbaum.(1) Count the total, the total counts. An in- 
crease in one category associated with a greater decrease in 
the others is good; a decrease in one category associated with 
a greater increase in the others is bad. 


3. Keep informed on "market quality." 
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If your quality is poorer than that of your competitors, 
you are in grave danger. If it is better by a wide margin, 
your quality costs may be too high. 





4, Maintain the balance between know-how and open minds. 


Create an atmosphere in which all the know-how in the or- 
ganization may be used, but do not let opinions stand in the 
way of finding facts. 


ACCOUNTING 


1. Maintain adequate records of internal quality costs. 


For general management, the breakdown into prevention, 
appraisal, and failure may be enough. For the quality control 
manager, however, these should be further broken down by cost 
centers, parts, defects, and other classifications. For the 
essence of good quality control management is the concentra- 
tion of adequate quality control skills at the points of con- 
centration of quality costs. 


2. Maintain adequate records of purchased parts quality costs. 


RES® ARCH 


The costs of source inspection, detail sorting of lots re- 
jected by sampling, rework by the vendee, and other costs 
generated by poor quality of purchased parts are not properly 
chargeable to Inspection or Quality Control; they are incurred 
by Purchasing. They should be charged to Purchasing, which 
should, at its discretion, charge them in turn to the vendor. 


AND DEVELOPMENT 





1. Make full use of modern techniques of experimenting. 


Decisions made by Research and Development have important 
effects on quality and its control. Choices are made among 
materials, methods of testing them, methods of processing 
them. Many of these decisions are reached by running exper- 
iments. The traditional methods of experimenting are not 
only costly and inefficient; they also yield wrong answers. 
The Research and Development group which does not use modern 
techniques of experimenting is simply putting its company 
out of competition. 


2. Beware of the sensitive process. 


DESIGN 


A process requiring close control may succeed in the lab- 
oratory, but be too sensitive for the factory with its back- 
ground of more severe variation. The search for a less sensi- 
tive process may pay handsomely in reducing the cost of 
quality. 


1. Base design changes on factual feedback of information. 
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Design should be an active part of closed feedback loops, 
basing its changes on production difficulties, inspection re- 
sults, and customer reactions. The information from these 
sources should be based on factual data which have been dis- 
tilled by a reliable quality control staff. 


2. Recognize tolerancing as a major technique of engineering. 


While there still remains some degrce of guess-work in 
establishing tolerances, much progress has been made in devel- 
oping a truly scientific approach to this important subject. 
The following principles are now generally recognized: 


(a) A tolerance is of no value unless there are 
means available to verify conformance to it. 


(bo) The tolerance capability of a process can be 
measured scientifically. 


(c) Limitation of designs by the extremes of pos- 
sible tolerance combinations is unrealistic 
pessimism. 


3. Analyze designs for potential quality troubles. 


The Production and Quality Control groups provide valuable 
feedback information on this problem. 


4. Allow an adequate "proving-in" period for new designs. 


Do not be misled by a test of too few pieces, or by the 
fallacies of traditional methods of experimenting. 


PLANNING 


1. Use manufacturing equipment capable of meeting engineering 
tolerances. 


Do not depend entirely on unsupported claims. Have the 
equipment accuracy capability analyzed by competent quality 
control engineers. 


2. Use inspection and test equipment compatible with the accuracies 
required. 


Do not depend entirely on unsupported claims. Have the 


equipment accuracy checked by competent quality control engi- 
neers. 


' 


3. Before buying new equipment for quality reasons, be sure the 
present equipment is doing the best it can. 


Do not depend entirely on unsupported statements. Have 


the accuracy capability of the present equipment analyzed by 
competent quality control engineers. 
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4, Consider the quality aspects of the locations of inspection and 


test areas. 
The following questions, among others, arise: 
(a) Can adequate lighting be provided? 
(b) Can adequate freedom from noise, vibration, 


dust, and other troublesome environments be 
provided? 


(c) Is receiving inspection near the receiving 
area? Are inspection stations near their 
corresponding work flows? 


PURCHASING 


1. Assume the primary responsibility for the conformance of pur- 
chased parts to engineering specifications. 


The costs incurred by lack of conformance can properly be 
charged to the purchasing function. 


2. Keep informed on the costs of non-conformance of purchased parts. 


Get information on these costs from accounting. Use the 
services of the quality control vendor rating system in se- 
lecting vendors. 


3. Initiate contracts with qualified vendors only. 


Qualification may be determined in one or more of three 
ways: 


(a) Reputation for quality in products similar 
to the one for which a contract is being 
negotiated. 


(b) Experience with the vendor on similar products. 


(c) A quality control survey of the vendor's 
manufacturing and inspection facilities. 


4, Allow an adequate "proving-in" period for a vendor on a part he 
has not supplied before. 


In spite of reputation, experience, or an advance survey, 
the vendor may still encounter difficulties and/or misunder- 
standings. Advance samples shipped to the vendee for inspec- 
tion and approval, prior to the start of a substantial run, 
are advisable. 


5 Secure deliveries early enough to permit adequate inspection and 
a free decision to accept or reject. 


If parts are so badly needed that they mst be accepted 
even if non-conforming, quality trouble is in store. 
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6. Accept, and secure acceptance from vendors, of the principle of 
lot rejections. 


Lots of acceptable quality should be the normal expecta- 
tion. The verification of acceptability can usually be done 
by sampling inspection, at normal quality cost. If the lot 
quality is not acceptable, it can also be rejected at normal 
quality cost. The sorting of defective pieces, however, is 
an.excessive quality cost, and shculd be borne by the vendor, 
not the vendee. 

7. Cooperate in programs of crosschecks of gauges and test 
equipment. 


Such crosschecks are preventatives of misunderstandings 
between vendor and vendee. ‘Sometimes the best method of coop- 
erating is by letting the vendor's and vendee's quality con- 
trol groups set up and operate the programs without inter- 
ference. 


PRODUCTION 
1. Provide adequate quality training for operators. 


This is a platitude. But its importance mst still be 
stressed. The difficulty is that immediate problems compete 
more severely for the supervisor's time than do longer-term 
problems such as training. 


2. Use in-process inspection as a service to production. 


In-process inspection prevents the expenditure of further 
cost on defective parts. It also provides immediate feedback 
information for process control. In both respects it is a 
service, not a deterrent, to production. 


3. Do not ask process inspectors to supervise operators. 


It is sometimes suggested that roving inspectors point out 
directly to operators that their techniques or methods are 
faulty. This is a serious violation of the prerogatives of 
the production supervisor. Inspectors should criticize the 
product or the process, not the person. 


4, Do not expect acceptance inspectors to "flinch". 


The function of the acceptance inspector is to check the 
conformance of the piece to the specification. If it con- 
forms, he accepts; if it does not conform, he rejects. That 
is the end of the acceptance inspector's function. Reconsid- 
eration of rejects should be appealed only to a higher author- 
ity. Breaking of this policy is the beginning of loss of con- 
trol. 


5- Use the simpler modern quality control techniques as production 
tools. 
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6. 
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There have been developed. several modern quality control 
techniques which require little statistical skill but are 
based on pure logic. Operators should use these as tools of 
their own; not as tools which can be handled only by special- 
ists. 





Attack the chronic quality problems. 


The sporadic problems compete harder for attention, but 
the chronic problems are usually the more costly. 


Call in the specialists as soon as necessary. 


Some of the chronic quality problems do not yield to the 
shop know-how, even supplemented by the simpler quality con- 
trol techniques. When that point has been reached, call for 
the services of the specialists versed in the practical use 
of advanced quality control techniques, 


PRODUCTION CONTROL 





1. 


Consider the quality advantages of long production runs. 


Scheduling quite properly attempts to find the balance 
between the high inventory costs of long production runs and 
the high manufacturing cost of short runs. Do not overlook 
the fact that quality costs, as well as production costs, are 
lower on longer runs. 


Consider the quality advantages of large lot sizes. 


Quality costs are lower on large lots. 


Consider the quality advantages of "first in first out." 


Quality changes occur during production, for both clear 
and obscure causes. Identification of the corresponding parts 
is easy under "first in first out", difficult under "last in 
first out". 


Allow time for adequate inspection and a free decision to accept 
or reject. 


If parts are so badly needed that they must be accepted 
even if non-conforming, quality trouble is in store. 


INDUSTRIAL ENGINEERING 





l. 





Include adequate quality clauses in incentive plans. 


Strange as it may seem, there are incentive plans in ex- 
istence which pay bonus for quantity without regard to qual- 
ity. Most modern plans, however, do not pay for defective 
pieces. Nevertheless many of these plans fail to penalize 
for the inspection cost of finding the defectives. Such costs 
should also be charged against the incentive; otherwise the 
company pays for inspection in order to pay the quantity bonus 
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to the operator. 
2. In studying inspection methods, consider inspection accuracy. 


The value of inspection should be measured not by the nun- 
ber of pieces going through the inspector's hands, but by the 
number of defective and non-defective pieces correctly iden- 
tified. 


3. In time-studying inspection, consider inspection accuracy. 


The value of inspection should be measured not by the 
number of pieces going through the inspector's hands, but by 
the number of defective and non-defective pieces correctly 
identified. Furthermore, the speed of inspection is substan- 
tially affected by the number of borderline pieces on which 
decisions must be made. 


1. Do not exaggerate quality claims to the customer. 
The disappointed customer is the complaining customer. 
2. Do not exaggerate quality complaints to the factory. 


A comment such as: "Our quality was never as poor as it 
is now" may stir up people. But without specific, unexagger- 
ated facts, the resulting poorly-directed activity is likely 
to do more harm than good. 


3. Encourage direct contact between factory and field. 


It has been shown that direct customer contact by a per- 
sonable representative of quality control, with the contact 
controlled by Sales, not only provides feedback of factual 
quality information of value to the factory, but also improves 
customer relations. 


CONCLUSION 


The list could be further extended to include personnel, legal, 
marketing, advertising, industrial relations, receiving, shipping, main- 
tenance, and other groups. Thus it appears that each person in the or- 
ganization can make specific contributions to the success of quality 
control. 


Everybody controls quality - and how! 


s (1) A. V. Feigenbaum: "The Challenge of Total Quality Control." 
INDUSTRIAL QUALITY CONTROL, Vol. XIII, No. 11, May, 1957, ps. 17-23. 











MEASURING VISUAL CHARACTERISTICS 
C. A. REED 


GENERAL ELECTRIC COMPANY 


It is well known that “unless you can measure and express it in 
numbers ----- you don't know much about what you're doing”. 


Accurate measurement of thickness, weight, torque, noise, 
electrical energy, etc., is quite common. There are continuous ad- 
vancements being made in inspection gaging techniques, many of which 
are automatic and not only eliminate the inspector but automatically 
adjust the machine or process. In many instances the prime objective 
is to eliminate the human element. People just have too much varia- 
tion and are incapable of being consistent. 


The usual visual inspection depends almost wholly on the opinions 
of people, and still worse, there are usually, at best, only vague 
specifications, such as "must be suitable for plating” or "must be 
free of objectionable flaws”. Therefore, most of us have been depend- 
ing on the judgment of people who cannot make consistent or intelligent 
decisions because they have not been equipped to measure. It is not 
fair to the inspector, the company, the vendor, or the customer to 
allow the use of immeasureable opinions. To completely eliminate the 
human element in visual inspection would be a rather tall order, but 
there are ways of helping people do a better job. Of course, there is 
always the possibility of using samples or specimens, but in most cases 
it requires too many and there is the problem of upkeep due to deter- 
ioration and loss. 


To overcome this sort of situation we have had considerable 
success by using a system devised to numerically evaluate visual de- 
fects, whether for appearance or for functional characteristics. This 
system reduces inconsistent inspector's judgment and is simple enough 
so that people like to use it. It makes for better quality level 
understanding and communication between vendor, manufacturers and 
customer. It is readily adjusted to any desired quality level and is 
economical to use as it is designed around the conventional sampling 
plans but includes grading by degree of defectiveness. 


This system is operated in the following manner: 


ACCEPTABLE QUALITY SCORE Xn - is established by determining what the 
customer expects for a minimum score or the percent and degree of 
defectiveness you are willing to accept. 


LOT SCORE X - is established on each lot by sampling 20 parts, scoring 
them by the procedure below on a Scoring Work Sheet (Fig. 1) and taking 
the average of the 20 individual part scores. 





THE SCORING WORK SHEET - is intended to establish a uniform scale of 
scoring. Specimens of typical defects in the different score levels 
will be useful and should be available for the inspector's use when 
practical. It is important that the score given each piece represents 
the closest description of the worst defect on the Scoring Work Sheet. 





INSPECTION INSTRUCTIONS 





1. Select sample (20) parts at random. 


2. Inspect each part under optimum lighting conditions as 
follows: 


First hold part at arm's length. 

a. Inspect for detail - such as printing on a dial. 
b. Inspect for background defects. 

Second bring part in close. 

a. Inspect for detail. 

b. Inspect for background defects. 


NOTE: If any defect is found in the first step it will not be 
necessary to take the second step. 


COMPUTE LOT SCORE X - as in the following example. 








Score Number of Dials Score Times Number 
$ 2 10 
4 8 32 
3 7 21 
2 2 4 
1 l l 
-10 0 0 
Total 20 68 
xX = Score x Number = 68 = 3.4 
Number 20 


Lots having a score (x) greater than X, are acceptable. 
Lots having a score (X) less than X, are rejectable. 


The exception to the above is when -10 parts are found. If one 
(-10) part is found in the sample of 20 and the score is acceptable 
after the computation above, draw an additional sample of 30 parts 
and inspect for -10's only. If none are found in the 30 additional, 
accept the lot. If any more are found, reject the lot. 


If 2 or more -10's are found in the sample of 20, reject the lot 
immediately and note the dial score as (0) zero. 
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The example work sheet (Fig. 1) shows a previously established 
minimum acceptable score of 2.5 for this particular part. The lot on 
which the work sheet was made, scored 3.4 or .9 points better than 
minimum acceptable. Therefore, the decision to accept this lot was 
not made by the inspector's judgment or opinion, but rather by the 
established value of 2.5. The judgment on the part of the inspector 
was reduced to only determining whether what she observed on one part 
at a time fitted the description of, for example, class 3 or class 4 
degree of defectiveness. Thus the inspector did not have to decide 
whether the part was good or bad. 


When a situation exists that requires a change in acceptance 
level, either up or down, a simple matter of adjusting the acceptance 
number does the job completely and to the exact amount required. It 
is not necessary to require the inspector to "tighten up a little” or 
"be a little more lenient”. 


Establishing the most desirable acceptance score number is, of 
course, an important part of the system. The same general approach 
should be used as that used in establishing acceptance in any con- 
ventional sampling plan. In this case it not only depends on the 
percentage of defects you are willing to accept, but the degree of 
defectiveness of those defects. Often it is just a matter of the 
proper balance between what the capability of a process is and what 
is required in the final product. 


It is also most helpful to summarize these work sheets (Fig. 1) 
weekly or monthly and plot graphically the percent of the lots re- 
jected, the average score of the accepted lots and the average score 
of the rejected lots. This points out not only how bad the rejected 
lots were but how good the accepted lots were, and indicates trends 
or changes in the process. 


There are many ways in which this scoring by number can be 
applied, the details cf which should be worked out for each particular 
part or situation. 


When setting up one of these plans, keep in mind the general idea 
of breaking down and describing the visual characteristics and assign- 
ing numerical values in a form that reduces inspector's decision 
making and that results in the kind of protection you want. 
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VISUAL DEFECT SCORING WORK SHEET 





















































Chrome Plated Die-Cast Bezels 3 
oo a ® ows 
2 © bh & © 2 
Ees|ec|cogeE 
so — © own Ss 
ZalHnN|AHS 
None under close visual inspection. 
Wys | /0 
Not visible at arm's length -- visible only HY 
after close inspection. } 4 
M1 BL 
Not visible at arm's length at first glance -- 
apparent at close inspection -- later visible at HY 3 / 
arm's length by careful scrutiny -- pits, L 
scratches, light stains, minor flashing marks. iL 
Same as above except heavier or greater degree. I/ 
at te 
On first look visible at arm's length with | 
little effort -- blisters, peeling, heavy tool 1 
marks, dark stains, dents or heavy nicks. 
Any defect clearly visible at a distance of 
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Part No. Total Number _ 20 
Vendor Total Score X number = 
> 
Quantity 
Score X Number _ | 0+ score X 
Date Number ™ 
Insp. - 
Min, A t ore X 
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"QUALITY CONTROL IN LOCOMOTIVE MANUFACTURE" 


F. H. Howard 
Chief Inspector 
General Motors Diesel Limited 


This presentation will be confined to that part of the plant devoted 
to the fabrication and assembly of the Diesel locomotive and its struc- 
ture, and will not involve the manufacture of engines or electrical trac- 
tion and switching equipment. The rate of production during the period 
of development of this system varied, generally upwards, from three to 
seven complete locomotive units per week. 


The Diesel-electric locomotive is a self-propelled piece of electric 
motive-power, carrying its own Diesel-powered DC generating station. 
Variations in direction, horsepower, speed and pulling effort are 
achieved in the electrical sense by adjusting excitation of rotating 
equipment, and by the automatic alteration of polarities and motor hook- 
ups. These are accomplished principally by low-voltage relays, contactars, 
interlocks and switches, and furthermore there is a host of other cir- 
cuits - lighting, starting, charging, and protective among others, some 
of them AC - which are cut in and out from time to time, many auto- 
matically. The locomotive therefore has a relatively complex nervous 
system of high and low voltage DC, and 3=-phase low voltage AC. Its 
stressed foundation is a plate and structural steel fabrication capable 
of withstanding 400-ton pull and buff loads, This foundation carries 
most of the propulsion and auxiliary equipment, as well as a hood and cab 
to shelter machinery and crew; on certain types of locomotive, the hood 
and frame share the stresses as a deep box-girder. The swivel trucks car- 
ry, besides motors, both brake and spring rigging, with pulling and retar 
ding forces transmitted through their centre bearings to the underframe, 


A Diesel-electric locomotive sells for a little over $100 per HP, or 
close to $200,000 per freight unit, when it includes some of the additia~ 
al equipment added to our basic specification by the railway. It goes to 
a customer who carries it on his books as acapital asset; a locomotive 
fleet is of great intrinsic value to him, he may have secured the neces- 
sary funds with some difficulty, and he expects to earn his living by 
pulling trains with it; that is about the only way he can earn his 
living, and what's more, since he has bought it to replace a fleet of 
steam locomotives, many of which were only partially depreciated, he is 
obliged to look for a handsome return on his heavy investment. He will 
consequently use the equipment severely and intensely, frequently over- 
loading it, frequently under-maintaining it, and often with relatively 
untrained people. If this capital equipment becomes unavailable due to 
our inadequate design or workmanship, or even due to his own inadequacies, 
he is temporarily prevented from earning his living and develops an anti- 
pathy to the failed equipment. All of which is said to emphasize that our 
company is selling a very expensive article, whose reliability must be 
the over-riding consideration; in other words, the market has been asses- 
ed and found to have certain rigid requirements, with little aversion to 
a high price tag. The cost of achieving acceptable quality cannot be 
permitted to interfere when the market will tolerate only perfect perfor- 
mance and will pay for it. I will concede, however, that close attent- 
ion to manufacturing detail discloses areas where an adequate quality 
level can be achieved by somewhat relaxed standards of workmanship, areas 
where the customer js certainly not interested in paying any premiumfor a 
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quality that he cannot exploit, although our own integrity is frequently 
demonstrated by maintaining high standards even in those areas, 


Our locomotives are manufactured on an assembly line, each work sta- 
tion contributing fairly substantial man<-hour increments as a unit passes 
through. Depending upon the number of positions or fixtures in each sta- 
tion, a locomotive may spend from one to five days in one place before 
moving on to the next. Major components are built on subsidiary lines 
which feed into the main line at appropriate points, At this time I must 
describe the system whereby special requests or customer "extras" are in- 
troduced into a locomotve, An order for a group of units may include up 
to 50 extras, accessories that the buying railroad feels are essential to 
its operation, and for which it pays an additional price, Some of these 
extras are as complex as the locomotive itself, and are themselves 
broken down as a matter of efficiency, with portions applied at the 
various work stations; for example, the piping portions of a dozen extras 
may be applied in the piping station, along with the basic piping. In 
short, each station on the assembly line has to know the total net con- 
tribution it is to make to a particular order of locomotives, and it may 
handle units to three or four different specifications on as many 
successive days, I say net because in many cases extras involve not only 
additions but substitutions, and therefore deductions of basic equip- 
ment. 


This group of assembly stations constitutes what are called "con- 
sumer departments" because they consume the end-product of the remaining 
departments, which are known as “producer departments", These latter 
produce the piece-parts and minor sub<assemblies, in quantity enough for 
several weeks' consumption, and by part-number, according to a schedule, 
Their product is not necessarily related to a specific locomotive or 
order, but does embrace a specific run of various locomotives going 
through consumer departments, 


You will see that quality problems might differ markedly between 
these two groups, Our original plan of controlling quality was to assign 
an inspector to each Production Department, on each shift, under an 
Inspection foreman on each shift. In the producer departments, these 
inspectors were to sort out good from bad, provide casual in-process 
inspection when sorting permitted, write scrap or rework tags on the bad, 
and accept the good; it was our assignment too to count the good, and 
fill out forms notifying Accounting and Production Control what was to 
be added to inventory. A carbon copy of this acceptance form was used 
as & "move" order, (Material Handling would transport nothing without 
it), and so the inspector had to consult routing sheets to find out and 
mark down where the various lots of parts were destined to go. Locomo- 
tives being compositions of many thousand components, built in orders of 
from one to 75, and in three or four styles in one mix, these depart- 
ments produce an extremely high variety of parts, in relatively small 
lots; so a good proportion of the inspectors! time was spent in the cler 
ical activity, writing perhaps 100 acceptances per shift, and what time 
was devoted to quality tended to concentrate in the despatch area where 
nothing much could be prevented except somebody else getting sub-stand- 
ard material. Prior to the beginning of this narrative, our plant had 
installed IBM equipment, allowing us to adopt a system of pre-printed 
acceptance cards, so that all the inspector had left to hand-write was 
the quantity accepted. This had made more time available for inspecting, 
and much of it was useful, if rather undirected, "floating" inspection. 
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Our first move in improving Quality Control in these producer depart- 
ments was to have the inspector approve the "first-off" a machine or 
work=-bench, or inspect the lot according to a sampling plan extracted 
from US MIL-STD 105A, or both, Our AQL's were set at various figures 
judged to be consistent with the contributions of these departments to a 
quality locomotive: they ranged from 1% on critical sheet-metal to 10% 
on copper tubing, and were determined by comparing the cost of reworking 
items undetected by inspection against the cost of 100% inspection. 

Where pieces moved over several machines, as in a sheet-metal fabricating 
shop, we arranged to make the sampling check at each machine; finally the 
deck of pre-printed acceptance cards was put on wheels and accompanied 
the inspector on his rounds, so when he had approved 2 group of parts off 
its last machine he could affix the "move" tag, and go about his business, 
divorced from the old inspection area, where he had formerly been pinned 
down for too much of his day. Circulation among the various work sta- 
tions with pieces being inspected as operations were performed, combined 
with the abandonment of any sorting philosophy, could only result in a 
lowered cost of achieving quality, if not a better outgoing quality. 
Production was committed, under the sampling plan, to sort. out good from 
bad at the machine or work station where the sampling discovered un- 
acceptability, and re-submit their bona-fide best efforts. We devised a 
homemade kind of "p" chart which indicated quality trends and kept us all 
assured that things were under control. Figure 1 shows the form on which 
was accumulated the data for posting on the charts; it can be seen that 
we are simply keeping track of the percentage of defects found in the to- 
tal of pieces constituting all the samples selected in a shift; the ma- 
chine causing the defect is shown, where applicable, Figure 2 shows the 
chart for the sheet-metal department, with a line for each shift; the 
attainment of 4% defective we pronounced satisfactory for a so-called 
"job shop" operation. Now our Accounting procedure makes provision for 
scrap to be charged to the department creating it, and rework also to be 
chargeable to the department causing it, including charges coming back 
from the department finding the rework and having to do it, wherever it 
may be; our scrap and rework charges in these producer departments have 
traditionally been modest, and never cause for serious alarm, so we have 
never been obsessed by detailed analysis of those accounts, We thought 
it possibly more than a coincidence however when,we did compare charges 
in three departments before and after our change from sorting after the 
fact to formal sampling during the process, that they were found in two 
cases to be lowered by a third and a half respectively, There was 
decidedly less complaining from consumer departments on misfits and the 
like; Production was content that better control had been applied. 


After several months of satisfactory experience with this new mode, 
and having duly contemplated the results of a work=<sampling study made 
on these inspectors, we reached for further improvement, and re-organized 
the activity even more. We have assigned men specifically to all count- 
ing and identification, and relieved the departmental inspectors from 
any responsibility except that of quality; these men do nothing but 
count parts and handle the IBM paperwork in six produwer departments. 

In some cases this has meant the rebirth of the old "inspection" area in 
the form of a "despatch" area, since counters cannot distinguish what is 
acceptable and therefore ready to move, until it has been laid out in 
front of them, Certain weaknesses may be detected in this approach, 

but it certainly gets the business of counting away from the business 

of Quality Controlling, and incidentally lets us see what it costs, 
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The resident inspector has been removed from the pipe shop and sample 
inspection of individual pipes ‘and pipe assemblies by part number has 
ceased. Control has been applied to the various processes for attri- 
butes which we feel are essential to the reliability of a unit, whose 
secondary nervous system can be said to consist of piping filled with 
air at various pressures, hot and cold water, fuel, lubricant, steam and 
sand, This is handled by a piping inspector whose jurisdiction is shop- 
wide, and of whom more later; instead of sampling lots of pipes and tubes 
to find out, for example, which are too long - it will soon enough be 
discovered if they are too long, or too short - charts have been placed 
on the flaring, de-burring, threading, soldering, brazing, bending, and 
washing operations, where proper workmanship could make or break the 
quality oF our piping. These charts are "p" charts with "C" chart in- 
formation on them; the latter was added because it is essential to col- 
lect data as it occurs, since it carinot be duplicated in our kind of 
shop. The inspector mkes his examinations of processes rather than 
parts, on a random basis, without regard to the particular piece being 
processed at the time. Being shop-wide, he feeds back to himself and 
the foreman all the grief encountered on the line, so that corrective 
steps can be taken immediately. 


Resident inspection, sampling all work at all work places, has been 
removed from both the welded sub-assembly and the sheet-metal depart- 
ments, From the work sheets associated with our early chart, we had 
learned which machines in the latter area contributed the most to the de- 
fect picture, and control charts have been applied to the ten most signi- 
ficant of them. One inspector now patrols both these departments - the 
one using up about 40% of the production of the other, and consequently 
the two side by side - appraising the various operations and operators, 
and acquainting Production supervision with the contemporary state of 
affairs. This is carried on with no less confidence than the full-scale 
sampling, since so little of the product affects the reliability of the 
locomotive, and what does is well-known, Many of our stesl pieces go 
into fixtures for assembly, where discrepancies in dimension are quickly 
apparent, and if holes don't line up with other holes, the defect is 
self-checking. At some small expense of catching defects later in the 
cycle, we have gained much by reducing costly detailed inspection of 
low-value parts, that have consistently been in satisfactory control any- 
way. We hope that we can combine still a third department, the machine 
shop, into this inspector's area of responsibility, since its kind of 
operation - cleaning-up rather than close dimensional work 
- appears to lend itself to the system. 


Classification of characteristics was seized upon in the sheet-—metal 
fabricating shop to lower the cost of achieving quality. This department 
has been mentioned several times already, and it is in many ways a key 
one in locomotive manufacture, just as I imagine it would be in aircraft 
or ship-building. But it seemed to us that it was getting an excessive 
proportion of our Quality Control efforts relative to the value and inm- 
portance of its product, so it was not a key department in our operation. 
Too much time was spent, for one thing, in reviewing and justifying in- 
spectors' decisions, such as whether it mattered if a 2" flange was ac- 
tually 2 1/16"; a lot of this discussion was by people unenlightened as 
to actual fact. The drawings of course were quite unequivocal, and cal- 
led for a routine # 1/32" on all fractional dimensions (which can hard- 
ly be held on general purpose machinery), but like most shops of this na- 
ture, work was set-up by and checked against templates, By putting on a 
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"crash" survey program in the consuming departments over a 2<day period, 
we determined the relative importance of each characteristic on the bulk 
of basic pieces produced in this shop. The respective templates were 
then colour-coded to show both Production operators and Quality Control 
appraisors which characteristics were of critical and which of major im- 
portance; they had not known previously. To be precise, we developed a 
template convention consisting of a series of symbols denoting parallel- 
ness, edge distance, squareness, and so on, then painted these signs on 
the templates with orange for critical ( #1/32" . nothing at all for 
major ( #1/16" ) and green for minor (#1/4" ). Figure 3 shows some 
of these conventional signs. The minor decisions have not formally been 
exposed to Production workers, although they undoubtedly got to know what 
the green code meant. This obviates discussion as to whether inspectors! 
decisions are to be upheld or reversed, Many of us feel that coding of 
dimensions as to their criticalness, or alternatively setting an appro- 
priate tolerance on each dimension, should originate in the Mngineering 
Department as the design is conceived at the creative source; but it is 
rather late in the day to start coding drawings in our Company. We 
think it would be less expensive too, as having it done by a later autho- 
rity is only refined second-guessing; our Engineers are not unsympathetic 
to this scheme, At any rate, we have reduced the cost of deciding 
whether a part is good or bad, as well as the cost of reworking something 
which was good enough all the time, 





‘Tee following are \ypical cases of coding templates for critics] ant aon | - ——— | 3. Mele spacing. 
critical charecteristics. Dimensions thet should be held tos tolerance of L — 
ples or aims 1/32 should be coded red as shown is the emaples; these 
parnivted te wary up to L/h should be coded grenn. Th is understend that 
characteristics mot coded at all will be held to © tolerance of plus or t ] 
mimes 1/16 tne. | 4. ole location free stges. 
) ) 
| 
- — - 
_— (a) Overall Dimension. 

a 
= rt 

\ $. Comentrtéiny 


| 

| (e) Retine. 

6. Contour (Retire shape). 

| r+] 2. tele Disaster. \ 7. Contour (Portion only). 
\ 


So much for the producer groups; let us examine the consumer dejart- 
ments, the major assembly areas, The line consists of work stations 
where the locomotive underframe is first manufactured in sections; then 
incorporated into a whole = both principally heavy welding operations; 
deck-plated; equipped underneath with wiring, fuel tanks, and especially 
piping; then equipped on top with engine, generator, switchgear, and so 
on, followed by application of the cab and hood; then set on trucks, put 
through a stage of final equipping, and despatched to Paint and Test. 
Major assemblies like cabs and hoods, all equipped internally with their 
own piping, wiring, and operating devices, are built in their own 
ancillary areas, as are the trucks, Bear in mind that each of these 
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assemblies and partially finished locomotives has its own identity, and 
may differ in much detail from its predecessors and successors, Further- 
more, it had become expedient, and still is, for Production to ignore the 
Bill of Material shown on the drawings, and to rely upon a composite bill 
supplied by Production Engineering. 


In every consumer department, the work it has to do has been laid 
down, operation by operation, on a check-off sheet; moreover in many 
cases, each operation has been described separately on its own miniature 
check sheet, the whole collection of such bits of paper being bound to- 
gether in a book called immediately and inevitably a "check book", It can 
be appreciated that here is a vehicle for the introduction of "extras" 
exactly where they belong, either by the insertion of an additional check, 
or substituting one for a basic check, Here also is a vehicle for the 
inspector to make sure everything has been done that his department ought 
to do, because each check has to be signed off by Production before 
Inspection will look at that particular increment of work. Furthermore, 
the entire book of checks has to be counter-signed by Inspection before 
the locomotive can move on out of the departmenta Figure 4 shows a 
typical check; this one calls for the inclusion of a customer "extra", 
and the drawings describing the entire application are catalogued. All 
electrical checks have been assigned to a shop-wide Electrical inspector, 
who demands very high standards on the "primary nervous system", Each 
inspector, if he disapproves of the work in any way, is to list his 
points of difference on the back of the check and return it to the 
Production foreman for a second attempt at doing it properly. Figure 5 
shows a typical rejected operation. 
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At the time this was introduced, it was appreciated that right here 
was the source material for some rough statistical analysis. All checks 
were directed to the Inspection headquarters, and defects listed on the 
backs were there plotted, locomotive by locomotive, and department by de- 
partment on "C" charts; a simple set of descriptions was appended to 
break down defects, From time to time a ietter would go out to Product- 
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ion supervision, apprising them of the situations in various areas, 
pointing out what kinds of defects had been perpetrated, and from Perado 
curves what operation numbers had been trouble spots and what employees 
had been principal violators, Later on, control limits were calculated 
and instances of break-through reported out, 


There were at least two faults with this statistical scheme, One was 
that the charts were kept out of sight in the Inspection office area, and 
the other was that the analysis covered periods too far in the past. No 
noticeable improvement took place as months went by. However, one im- 
mediate and quite striking improvement had been achieved. Now that each 
operation was spelied out specifically for both Production and Inspection, 
the number of minor but irritating defects caught in the final super-in- 
spection had dropped by over three-quarters, That was quite tangible, 
and justified the amount of work put into preparation and issuance of 
the check books, each expended on one locomotive only, in one department. 


All checks were filed away in locomotive order and kept for a few 
months, so that any defect reported from the field could be traced back 
to the check covering the work. Since two people had signed the check 
as having seen the work done properly, they could be approached on it. 
There turned out to be very little of this necessary however, as our 
second immediate and striking improvement was that field-reported or 
customer-reported defects almost disappeared, 
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Gradually then, our control charts were moved out to the shop, for 
display in the various departments, and daily posting. This of course 
had the effect of bringing them up-to-date and making them of more than 
historical interest; no more had analysis letters to be written, because 
the picture was there for all to see, although high-incidence operation 
numbers and employee identities remained to be exposed. However, that 
turned out a minor deficiency: just as the "book" said would happen, 
the day these charts appeared, a trend began - downwards, Men, and 
foremen, crowded around; rivalry was created; a defensive attitude - not 
usually desirable, but this time a sign of challenged pride - grew up, 
Figure 6 shows where one foreman plunged in and started his ow statisti- 
cal analysis, a sort of moving average, and his coloured pencil worked 
overtime highlighting. It is a commentary on this kind of manufacture 
when I explain that the sudden upward jump near the end occurred when a 
major change had been introduced, and the foreman was temporarily dis- 
tracted from Quality by other problems attendant to it, 





Sloping control limits have been applied on an idea borrowed from an 
aircraft plant. With only a handful of defects in a complex of 200 man- 
hours, who wants to know the operation numbers or the men accountable? 
With a final turn of the screw perfection is in sight! Our Inspectors, 
I daresay quite naturally, tended to become a little more critical ina 
resolve to prevent utter perfection. Various conflicting ideas of 
criticalness are a source of trouble, incidentally, in the inspection of 
a locomotive, 


Elsewhere, improvement came in a different way. Figure 7 shows a 
chart of defects in a department where it seemed difficult to get matters 
under control, At the suggestion of Production management, who felt that 
this one chart failed to direct attention to trouble-spots, separate 
charts were displayed at each of the three or four progressive work sta- 
tions within the department, so the principal offender could be devoted 
extra supervision, The small charts' figures were totalled and the 
result can be seen as satisfactory; the master chart is now broken down 
by work stations, instead of by attributes, 


All these charts were, and are, kept up by the shop inspectors, who 
early became admirers of the technique, and required only instruction to 
put us in the fortunate position of not requiring anybody to go around 
and post charts, The final inspector, the eagle-eye who audits all the 
other inspectors, was given a pad of forms on which he registered the 
defects he found and the source whence they came; each preceding depart- 
ment had its control chart modified by what this "mopper-up" discovered 
and turned in. We had forecast that, based on prior experience with 
operation and worker numbers, these departments would require an 
analysis of their defects in more detail than the charts would show; so 
their own inspectors had these pads too - a sheet is shown on figure & - 
and the kind of defect was coded numerically, This form was used as a 
work sheet for the chart. Having dispensed with a "chart poster" and 
analyst, as was just mentioned, there was nobody to sort out the various 
classifications of defect, and it was accordingly arranged with our IBM 
group that information on these sheets would be punched onto cards, and 
tabulated by shift, employee, operation number and nature of defect, 
whenever we felt we wanted it. But this has never come to pass; with the 
exception of one department, whose chart you just saw come into control, 
the situation has never warranted this analysis, and our forecast has, 
so far, been wrong. We would just as soon it stayed wrong. 
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Our final refinement has now begun, after several months of -acclim- 
tization to this ress nably professional system. Having been assured 
that quality can be -wintained at a high level, we have looked for some 
profit by reducing the expense of so maintaining it. Our checkbook and 
check sheet system seemed to lend itself to a very careful coding of 
every operation according to its importance, In six consumer depart- 
ments, therefore, each operation has been consigned to one of three 
groups, The most critical ones have been flagged as mandatory check 
points, where inspectors must witness if possible, but certainly qualify 
upon completion, the worked called up. An example of an operation re- 
quiring mandatory inspection is the alignment of the Diesel engine and 
the traction generator; 100% inspection is laid down without relaxation, 
Such checks are surcharged "M", Certain operations where poor workman- 
ship can affect the performance of the locomotive, or can cause the 
customer trouble or unjustified maintenance, or even lower our reputa- 
tion in his eyes, are surcharged "Q" and require 100% inspection, but not 
necessarily at the time the work is done. The remainder are surcharged 
"S" and cover work that received casual or sampling inspection at the 
same time that "M" or "Q" inspection is carried out; these jobs in- 
variably are of the kind that have to be done properly before the next 
job can be started, or can be done only one way. Electrical checks are, 
as before, to be looked after by the Electrical inspector on a "Q" basis, 
We have now an Inspection Specification, as it were; little black books 
and private information are unlamented in their departure. On this 
philosophy, which contains no measurable element of risk in our opinion, 
we have been able to reduce the manpower in the consumer departments, 
retaining the final inspector because the value of our outgoing product 
still demands him; however, he performs his share of the coded checking. 
The inspector at the piping position is the general piping inspector, 
referred to on page 5,and carries out the air test on the piped under- 
frame, a four-hour soap-sud immersion that cannot be dispensed with ex- 
cept at the high cost of finding and repairing air leaks at the 
Operational Test station. 





One of the unwritten assignments of most inspectors, generally assum 
ed by themselves and not seriously challenged by supervision, tradition- 
ally has been to carry out simple investigations when things don't go to- 
gether properly in their areas; such investigation used to lead back to 
drawing mistakes, use of wrong blueprint revisions, template errors, pre- 
vious inspection errors, and so on, but seldom took inspectors off the 
production floor, and prolonged inquiry could not be proceeded with be- 
cause their presence was necessary for full-scale process inspection. 
When necessary, Production Engineering used to be brought into the af- 
fair. With all-day resident inspection no longer in effect, a pair of 
"Problem" men has been assigned to take charge of the entire range of 
this research, and Production Engineering's participation has been great- 
ly reduced. Due to the wide scopa of this overall Quality Control, we 
are contributing towards a real net reduction in Quality Cost. These 
Quality Control men maintain all departmental charts in addition to being 
on call by Production or Inspection when fabrication problems arise, 


Coincident with our best shop efforts at Quality Control is our feed- 
back of information from the customers' properties, There is really no 
time limit on the receipt of this intelligence, and 8-year old (million- 
mile) units developing ailments traceable back to our design or workman- 
ship occasionally receive mention in our Quality Reports. However, the 
first few months of employment usually show up any defects attributable 
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to the shop operations as covered in this presentation. These are re- 
ported by our Service Engineers engaged in continuous customer contact 
and inspection of units in service, and are presented to Quality Control 
for their review. If these deficiencies are in connection with something 
we are competent to comment on, or to do something about, we enter our 
remarks on the Quality Report; if one of them has arisen on something 
made in our shop, interested departments are designated, and where appli- 
cable the number of the check book or check sheet operation, The reports 
are then duplicated and distributed to Purchasing and Receiving Inspect- 
ion for vendor attention, and to Production and Inspection for shop at- 
tention; anything for Engineering scrutiny, or susceptible to design 
change is relayed to that authority, The departmental Control chart in- 
volved gets disfigured by a big red spot at the point where the offending 
defect has been allowed to escape into the field; as described previous- 
ly, the check is still on file, and the subject can be discussed with the 
foreman, inspector, and workman, Figure 9 shows a typical Quality Report 
with the department and check number shown, as directed tc the shop floor 
The advent of the check book seemed to coincide with the virtual dis- 
appearance of Quality Reports on shop workmanship, and only one or two 
are coming in each month, 


CONFIDENTIAL DATE July 26, 1957 
QUALITY REPORT SUMMARY No. #808 
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Figure 9 


This is the situation as it exists in General Motors Diesel Limited. 
We feel that high-priced piece of machinery, assembled to a great ex- 
tent by hand and with a minimum of automatic mchines or processes, re=- 
quires an Inspection operation that probably can never fall below a cer- 
tain minimum, We, however, are also convinced that the best Quality 
Control that we know how or can find out how to apply, will unquestion- 
ably result in Production attention to workmanship standards approaching 
quite closely what detailed Inspection has enforced in the past. It's 
just as easy to make it right the first time, it's obviously the cheap- 
est way, and training for prevention rather than sorting is, in our kind 
of business, the professional route to Quality. 


201 











AN APPROACH TO QUALITY CONTROL IN AN AUTOMOTIVE PLANT 


F. A. Stewart 
Chrysler Corporation 


This paper deals with what I believe to be the fastest, ficklest, 
frustratingest but most fascinating business in the world ---- the auto~ 
mobile business, I know of no other business where you have a complete- 
ly new or changed product each year and usually a completely changed 
plant or facility in which to build them, That is not even to mention 
the usual running changes and series of crash programs during a model 
year, which in themselves are enough to keep things off balance. When 
you combine these conditions with Quality Control, which is a relatively 
new and rapidly changing field, it can make one wonder about his point 
of reference from which to work, Having been a Product Engineer for a 
number of years and a relative new-comer to the field of Quality Control, 
I can perhaps appreciate these thoughts a little more than many people 
who are more experienced in this field. 


At the Chrysler Division our approach to Quality Control is as prac- 
tical and down to earth as we can possibly make it. For this reason, 
rather than attempt to present anything technically new or profound, I 
have chosen to give you some of our thoughts and experiences in attempt- 
ing to establish a better organized approach to Quality Control in an 
automotive plant. 


We have been at this business for five or six years at the Chrysler 
Division, to a more or less degree. We have had many notable triumphs 
and about as many diasal failures, Looking back over the past and con- 
templating the future, we think we can catelogue many elements or ingred- 
ients in a program or phase of a program which will spell success or 
failure, 


Scope of Operation 
At Chrysler the types of operations to be found range from a very 


highly automated and elaborately tooled machine shop with relatively few 
direct labor people (Figure 1), to an elaborately tooled sheet metal and 





Of course there are a wide variety of operations with all shades of gray 
of the dements mentioned above, 


The materials processed range from castings and forgings in the ma- 
chining operations through stampings in the sheet metal and welding oper- 
ations to such things as cloth and soft trim and paint. In addition to 
the above, tremendous quantities of glass, rubber, adhesives, bright 
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metal parts and vendor supplied sub-assemblies are processed in the as- 
sembly operations, 


From this brief description of the operation it probably is quite 
apparent that the approach to quality control would of necessity take 
many forms, with techniques and applications tailored to various mater- 
jals and processes, The fundamental theme of defect prevention remains 
the same in all areas, however, which invariably involves the determin- 

tion of the cause and effect relationship with resultant removal or con- 


The organizational structure which is currently employed or is in 
the process of being established consists of two essentially parallel 
organizations, One is Inspection under a General Chief Inspector and 
the other is a Quality Engineering Group under a Supervisor of Quality 
Engineering, each of which reports to the Quality Assurance Manager who 
in turn reports to the Chief Manufacturing Executive. The Inspection 
section is in the process of being organized along three basic lines; 
Receiving, In Process, and Final Car Inspection. The Quality Engineering 
Group is organized similarly along the lines of Receiving and In Process 
with the third area being Quality Costs, 


The functional reporting relationship would then look like the fol- 
lowing chart: (Figure )) 
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Inspection is basically the data gathering and fact finding agency 
as well as carrying on the routine inspection functions, The Quality 
Engineering Group is the planning, data analyzing and special study 
group. ‘There is considerable effort axerted to prevent either of these 
groups from becoming overly dominant or subservient of the other but in- 
stead to have each of their efforts complement and augment the other's, 





I might mention at this point that the Quality Engineering Group is 
an outgrowth or an expansion of a Statistical Quality Control Department 
we have had for a number of years, By reinforcing this area with more 
highly skilled or trained people but seasoned with older men with broad 
experience, we seem to have arrived at the best combination to make the 
group most effective and respected. One serious hasard I believe has 
been avoided, that is to have the Statistical Quality Control Depart- 
ment looked upon by the remainder of the organization as a slot in the 
wall through which you pass a group of data and after waiting a reason- 
able length of time the data somewhat regrouped and analyzed comes back 
through the slot for whatever use anyone wishes to put it. I believe we 
have also reduced the possibility of another facet of the same hazard 
occurring, and that is to have the Quality Control people become too 
intrigued with techniques and systems for technique's and system's sake 
that the sight of the objective which we are all after is entirely lost, 
We are continually trying to create the feeling and spirit in all people 
concerned that the only justification of the existence of Quality Control 
people is, through the use of their training and tools at their disposal, 
to help production build a product at an acceptable quality level at a 
competitive cost. 


Briefly, the duties and responsibilities of the various areas of 
Inspection and Quality Engineering are as follows: 


Inspection 


Receiving Inspection is divided into six major areas which is prim 
arily dictated by plant layout, These areas are pretty well devoted to 
a particular type of commodity received such as: engine and chassis 
parts, chassis electrical, body stampings and sheet metal, hard trim and 
hardware, body electrical and soft trim, Each of these areas is charged 
with the responsibility of carrying out a pre-determined inspection and 
sampling plan, maintaining performance records by part number and vendor 
name, In addition they mst coordinate their efforts with the Material 
Department in order to see to it that material is moved promtly through 
Receiving and Inspection to the lines or returned to the vendor, sorted, 
repaired or reworked as the particular case may dictate. ‘They mst keep 
vendors and our Production Control Department informed with up-to-the- 
minute information, 


Receiving is a tremendously big job and mst move very quickly. 
Some idea of the complexity of this job is evident from the facts that 
at the Chrysler Division alone, approximately 8500 different parts are 
received and processed, During a normal day approximately 1000 lots or 
shipments are received, This is roughly equivalent to 350 trucks and 28 
freight cars filled with 3 million pounds of mterial, the dollar value 
of which is approximately two million dollars, In the interest of con- 
trolling the investment in inventory, the average days' supply of mter- 
ial on hand is approximately five days, On many items, particularly the 
costly ones, the supply is often measured in terms of hours rather than 
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days. 


At the present time the Receiving Inspection areas make use of the 
to 


- MIL STD 105A sampling tables. Eventually we plan to expand this full 
- usage. 


Production Control and the vendor are notified immediately of a re- 
Ls jected shipment. It is usually practical, for the burden of replacing 
at the material, repairing, » or sorting of the defective material 
is placed on the vendor, Quality information at Receiving is also pro- 
cessed through IBM machines and reported to Quality Engineering where 
monthly vendor quality performance trends are compiled and reported to 
Purchasing and other interested agencies for guiding our vendor relation- 
ships. 


- The In Process Inspection is responsible for all inspection through- 

; out the plant. This includes the process inspectors working to a pre- 

i€ determined sampling plan for obtaining information which is posted on 
charts in each line foreman's area as well as inspection at buy-out 
points throughout the lines, Here again, the information obtained is 

} reported back to Quality Engineering where it is analysed and appropriate 

be reports are issued, 


rol The Final Car Inspection is a hundred per cent inspection operation 

il, on each car after all of the work and previous inspections are complete. 
The purpose of this operation is to buy each car for the customer as well 
as providing a good audit of outgoing quality. Hourly and daily reports 
and commnications are issued to all areas of the plant, including Inspeo- 
tion and Quality Engineering on up-to-the-minute quality problems, 


Quality Engineering 


As mentioned earlier, the Quality Engineering function is broken 
down into three functions; namely — Receiving, In Process and Quality 
Costs, with an engineer in charge of each. 


a 


The Receiving Engineering function is oriented between product 
engineering, purchasing, inspection and production. It is responsible 
for designing and helping to implement appropriate Receiving Inspection 
procedures and plans, analyzing data obtained from Inspection and issu- 
ing appropriate reports, counciling product engineering, purchasing and 
vendors on vendor quality problems and designing and conducting any 
special test which may be required. 


co~ pr 5&5 BE 


The In Process Engineering function is oriented between product 
engineering, master mechanics or tool engineering, inspection and pro= 
duction, The responsibilities of this area are essentially the same as 


those for Receiving Engineering except as they would apply to the In 
Process activities, 


; The third and most recently added Quality Engineering function is 

8 that of Quality Costs, A definite need was felt for having a focal 

) point for the analysis and reporting of the various factors which go to 

; make up the cost of quality such as; warranty, scrap, rework and repairs 
- inspection costs, etc. This function is oriented between service, product 
8 engineering, inspection, production and various financial agencies, We 

’ have great hopes for better guidance in our efforts through a better or- 
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ganized approach to quality costs, 





In addition to the routine analysis and reporting of quality infor- 
mation and the conducting of special studies, the Quality Engineering 
Group also conducts weekly quality meetings with each General Superin- 
tendent and his supervision, Present at these meetings are all of the 
service agencies such as product engineering, tool engineering and inspeo- 
tion; in order to bring their efforts to bear on quality problems which 
are encountered by the General Superintendent involved. These meetings 
have been very fruitful in the solution of quality problems and have 
done a lot to sell quality control to production supervision by demonstra 
ting at the local level how it can be of help to then, 





Key to Success of Program 





In our experience we have had it demonstrated so vividly so many 
times that the key to success of a quality control program can be summed 
up in one word attitudes, Proper attitudes are so all important 
and the one ingredient without which no program can be a success, In 
the final analysis, isn't this true of about any undertaking? 





The attitude of Top Management towards product quality, responsibi- 
lities for it and decisions which influence it is all important. I ven- 
ture to say that there are very few company executives who would not nod 
their head yes when asked if they are for product quality or product 
quality improvement at reduced cost through quality control. It is often 
quite easy to overlook the fact, however, that product quality is affect- 
ed by practically every decision and plan from the product planning and 
design stage through all of the remaining stages of plant layout, tool- 
ing, procurement, manning, production, inspection and shipment until it 
arrives in the customer's hands ready for use. It is just as easy to 
overlook the fact that product quality then becomes an integral part of 
the responsibility of every area of the organization required to carry 
out these functions, which is practically everyone. 


The attitude of manufacturing or production management cannot be 
over-emphasized., This is perhaps the most important attitude from the 
workmanship phase of product quality, It's amazing how the manufactur- 
ing management's attitude can permeate the entire production organiza- 
tion down to the lowest man, It follows from this, that a healthy atti- 
tude by Production supervision and a willing acceptance of product qual- 
ity responsibilities will mean a great deal at the grass-roots level --- 
the operator himself, 


Of course we can't overlook the attitudes of the quality people 
themselves, As I mentioned earlier, we are contimally trying to create 
the attitude in all of the quality people that the main reason for their 
existence is to aid every phase of manufacturing to carry out their pro- 
duct quality responsibilities. It is not good enough to point out that 
quality is production's responsibility and then do no more than criticize 
for this reeponsibility not being carried out properly. ‘The quality peo- 
ple mst be willing to mix right into the production ani quality problems, 
objectively analyze the problems, fairly place responsibilities, ask for 
corrections, and help provide a vigorous follow-up. We have found this 
has a chain reaction effect which makes production feel as though some- 
one cares about their quality problems and is willing to help them, in 
turn creating a proper climate for a healthy quality attitude and thus 
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reducing your quality problems in both intensity and mumber, As you 
know, the production people, of necessity, are a hard headed practical 
lot directly faced with the daily quality problems are well as many 
others, If they can see some concrete evidence of help in solving their 
quality problems, their full cooperation is usually easily obtained, 


One of the best examples of this in our organization is a works 
manager who is of the old school "bull of the woods" type. When it was 
first attempted to install a quality control system in his area he was 
not too pleased and as a matter of fact, fought it very strenuously. 
Finally through the quality people getting into some of his toughest 
quality problems, conducting machine and process capability studies, 
giving him good information on scrap and quality performance of his 
supervision, he became convinced that the quality people were giving him 
help in highlighting the quality elements which were within his control 
and making sound recommendations as to how to control them, He was also 
impressed with the way the quality people went after his quality pro- 
blems which were beyond his control, As a result, he became one of the 
dominant figures in the program; the results of which have been excep- 
tionally good, as demonstrated by the following charts of per cent defec- 
tive vs time, as examples, (Figure 5). 


The kind of results exemplified by these curves are brought about 
by close working team-work of production, Quality Control and other tech- 
nical service agencies, These curves also demonstrate to me that Quality 
Control has very little glamour connected with it or spectacular things 
happening as a result of it. Instead, it is a day to day, constantly 
plugging type of job contimally looking ahead at the long term outlook. 


The other day, the works manager mentioned above, stopped in my 
office to discuss a few problems, At this time he commented that if he 
were to set up a new operation, he would install exactly the same system, 
Coming from him, this was one of the highest compliments which could be 
paid the program and exemplified the spirit or attitude which has con- 
tributed to its success, 
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THE MANUFACTURING PROGRESS FUNCTION 
Richard Conway and Andrew Schultz, Jr. 
Department of Industrial and Engineering Administration 
Cornell University 


I. INTRODUCTION AND BACKGROUND 





A. Objective of Paper 
B. Manufacturing Progress Defined 
C. Measurements of Progress 
D. Previous Use in Aircraft Industry 
1. General shape 
2. Mathematical description - various bases 
3. Logical basis 
4. Inconsistencies in its use 


II. CAUSE AND EFFECT - LOGICAL RELATIONSHIPS 
A. Pre-production Factors 
1. Tooling, Product Design, Process Design, Methods, 
Organization, etc. 
B. During Production 
1. Operator, Methods, Design and Operation Changes, 
Engineering Effort, Volume Changes 


III. EMPIRICAL STUDIES 


A. Examples 
B. Inferences 
l. Fits 


2. Prediction 


C. Areas of Application 
1. Firms 
2. Products 


D. Problems of Application 

Parameters 

Identification of factors - statistical relations 
Aggregation 

Classification 

Points of estimation 
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IV. CONCLUSIONS 
A. Applicability 


B. Reliability 
C. Further Study 
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APPLICATION OF THE IBM 650 MDDPM 
TO THE MANUFACTURING PROGRESS FUNCTION 


William J. Halladay, Jr. 
Project Engineer 
International Business Machines 


A VERY BRIEF DESCRIPTION OF THE 650 MAGNETIC DRUM DATA PROCESSING 
MACHINE. 


The 650 is defined as a digital electronic computer, the 
basic units are described and the two primary types of program- 
ming are identified. 


THE USE OF THE 650 MDDPM TO CALCULATE TABLES FOR THE APPLICATION 
OF THE MANUFACTURING PROGRESS FUNCTION. 


The 650 is programmed to calculate two tables: (1) the x° 
tables where values of X from 2 to 32,000 are raised to fourteen 
different b power values, and (2) the E table where a constant E 
is calculated for the same values of X and b_as in the xD table. 
The X” values are used in the basic formula Y = a/X° and the E 
table is used to calculate M.P.F. hours for given lot sizes. 


THE USE OF THE 650 MDDPM TO CALCULATE AND EVALUATE THE SLOPE OF 
THE MANUFACTURING PROGRESS FUNCTION. 


Three integrated 650 programs are used to calculate and eval- 
uate the slope of the M.P.F. Program #1 is a Basic Machine Lan- 
guage program designed to compile costs per lot by part number 
into cost per cumulative unit by unit or machine number. Program 
#2 is a "FLOPS" program to consolidate all cost coordinates (from 
Program #2) into cost coordinates with significant cost differ- 
ences between units of cumulative production. Program #3 is an- 
other "FLOPS" program to analyze the results of Program #2. Anal- 
ysis is in terms of a Least Squares Trend Line, upper and lower 
control limits and the regression coefficient. 
























THE MANUFACTURING PROGRESS FUNCTION, 
ITS APPLICATION TO OPERATIONS AT IBM, ENDICOTT 


Donald A. Schreiner 
Technical Engineer 
International Business Machines 


I. APPLICATION OF THE MANUFACTURING PROGRESS FUNCTION 


The following application of the Manufacturing Progress 
Function will be discussed: 


A. Cost of introducing a new machine or a change to a 
machine. 


B. Prediction of manpower and space requirements. 
C. Budgeting of special engineering and staff efforts. 


II. THE TECHNIQUES USED IN MAKING A MANUFACTURING PROGRESS FUNCTION 
STUDY. 


The techniques used for preparing estimates, selecting 
"ultimete" units of production, and "progress slopes" will be 
discussed in detail. 


III. THE RESULTS OF A MANUFACTURING PROGRESS FUNCTION STUDY. 
A discussion of how management is aided by use of the Manu- 


facturing Progress Function with regard to personnel transfers, 
cost analysis, and planning. 
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AN EXAMPLE IN STATISTICAL PLANNING OF LABORATORY EXPERIMENTS 


Or 


THE ENGINEER AND STATISTICIAN CAN MEET 


A Skit 
in Three Acts 


By 


The Engineer F. R. Del Priore 
The Statistician B. B. Day 


Adapted from 
"The Statistics in a Gear-Test Pregram" 
By B. B. Day and F. R. Del Priere 
U. S. Naval Engineering Experiment Statien 
Annapelis, Maryland 
INDUSTRIAL QUALITY CONTROL, Vol. IX Ne. 5, 
March 1953 
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Engineer's Office 
Stat: Goed morning, Mr. Del Priore. 


: Goed morning, Miss Day. Whom did you come toe bother this time? 
How about a cup ef coffee? 


Stat: No, thank you, I'v had my cup. You happen to be the victim. 
I've been thinking about that problem you were discussing at lunch 
last Wednesday. Perhaps I can help you. Would you be willing to go 
into the details more completely this morning? Have you the time? 


Engr: Oh, I always have time to talk about my baby, but I don't see 
how yeu can help. I know you have helped Mike and larry, but ay 
problem is different. 





Stat: Where have I heard that before? 


: And, besides, the test has not been run yet. Wait until I get 
the data, then I will be glad to have some help. In fact, I will need 
help because we are going to have reams of it. 


Stat: Believe it or net, it's befere you get the data that we can be 
of mest help; I mean in planning the test schedule. 


Engr: Oh. Besides, we don't have any money in the job order for your 
services. 


Stat: That's semething you don't have to worry about, fartunately. 

The Bureau of Ships prevides a special fund te run our effice, so we 
den't have to charge our time to the test set-up. Our services are 

free!! 





: That is fine. Well then, perhaps yeu would like to see the 
test set-up. 


Stat: I certainly do want te, but I would like to get some more back- 
ground information first. It would then mean more te me. What, 
definitely, is the purpose of this test? 





Engr: The long-range pregram - everall problem - the Navy mst reduce 
the noise generated by our submarines. With the increase in efficiency 
of sonar, this is an important problem. Our phase is te reduce the 
noise from the power transmission gears, particularly the reduction 
gears. This will include testing different gear geometry,materials, 
finishes, etc. That is the leng-range problem. 


Stat: Is this something new? This kind of testing? 

Engr: Many aspects ef it are new. The immediate preblem is to set up 
test cenditions for later testing different gears, etc., in the long- 
range preblem. This is the problem then. 


Stat: I see. 


























Engr: And the real reason is that we simply have to cut down on all the 
possible test conditions that we could run. 


Stat: Just what are the factors that you are considering? 


Engr: May I use my blackboard? I always talk better when I use the 
blackboard. 


Stat: Oh, please do. 


: The most important factor is direction of introducing the lubri- 
cant. 


Blackboard 
top 


¥ 


otton 

Engr: Here is the gear system and we can introduce the lubricant from 
the top or from the bottom. We can introduce it in mesh with the 
system or tangent. 


Stat: I see, there are then four possible directions. 


Engr: And we have designed a new nozzle to do this. Another factor is 
speed. With this set-up here we are thinking of testing two speeds - 
300 and 1200 rpm. Another factor is load, and we are thinking of test- 
ing 25 per cent of full load and 125 per cent. The fourth factor is 
volume - this is the amount of the lubricant, and we are thinking of 
1/2 gallon per minute, 1, 1-1/2, and 2 gallons per minute. 


Stat: Are these all - direction, speed, load, and volume? 


Blackboard 
Direction: top inmesh, top tangent, bottom 
inmesh, bottom tangent 
Speed: 300 and 1200 rpm 
Load: 25%, 105% 
Volume: 1/2, 1, 1-1/2, 2 gpm 
Temperature: 90°, 100°, 160°F 


Engr: Oh, wait a mimute! There is one more - temperature, and we are 
thinking of 902 100°, 160° Fahrenheit. Now, these are the factors that 
we think are important. 


Stat: Which is the most important? Direction? 


Engr: Yes. 


Stat: I see. 


Engr: This is going to be run with the standard gears? 
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What about the lubricant? 
r:; 2190-T, standard. 
Stat: Oh, ere you going to keep this constant for the test? 


Engr: Well, that is what the Navy is using all the time - most of the 
time. 


Stat: That sounds good. There are certain things now that we mst 
know to be able to plan efficiently. I'm going to ask a really diffi- 
cult question: Do any of these factors act jointly; for instance, 
speed and load? Suppose we plotted our load along here, and we ran a 
curve for each speed. Let's see, you're going to measure this in 
decibels, aren't you? 


xr; Decibels, yeah. You straightened us out on that. 


Stat: Good. Suppose you plotted decibels along the vertical axis and 
you you ran it at 300 rpm. You would get a certain noise level at the low 
load and you get another level at the high load. At the 1200 speed 

at the low load, it may be high and at the high load it may be low. 

Just the opposite effect. The noise was less. We would say then in the 
statistical language, there is an interaction effect between speed and 
load on noise. 


Blackboard (30° “en 
gad 11.00 Ypm 


me 


/ bead "Lead ia 
Engr: I don't know what you mean. 








Stat: Well, let's try again. Suppose we had a certain curve for this 
300 5; speed and then, for the 1200 speed, it was only higher, but with 
the same slope. The two lines are pretty mich parallel. Then we would 
say there is no interaction of speed and load on noise. Dr. Youden has 
explained it in a rather neat way. Your ten-year old boy Jimmy makes 
a certain amount of noise when he plays in his backyard. Your neighbor 
has an eight-year old boy who also makes a certain amount of noise in 
his backyard. Put Jimmy and the neighbor's boy together, and what will 
you get? 


Engr: A racket! 

Stat: Then, he carried it a little farther. Suppose you had a 15-year 
Old daughter and your neighbor had a 16-year old son. Each one in his 
own living roem in the evening would make a certain amount of noise, 
but put them together in the same room...... 

Engr: It would be awfully quiet! 


Stat: That is what we mean by interaction effect. 
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Engr: Suppose you come back to the second graph. 
Stat: All right. 


Engr: If one speed curve is, say, a little bit higher than the other 
speed curve but the same slope, then you say it's no interaction? 


Stat: That's right. 

Engr: Now, it mst then be a straight line? 

Stat: Oh, no, it might take any shape. Suppose we had three loads. 
The noise effect might show a curved line; but, if the curves are 
pretty mich the same shape, then we would say there is no interaction. 


Engr: I see, it is the same family of curves. Now this means that 
there is no interaction. 


Stat: That's right. 
xr: And you call the first plot: speed and load interact? 
Stat: The interaction might take different forms. For instance, if at 


one speed noise increased faster with an increase in load, we would 
have an interaction effect. 


Engr: Okay. 


Stat: Now, let's examine these factors in the light of what we've 
just said. How about direction with these other factors. Does it 
interact with any of them? 








: Well, so far as direction is concerned, it is new. It is a new 
factor and we don't know anything about it. 


Stat: I see. How about speed? 

Engr: Speed is tricky, Miss Day. We have studied speed before and we 
have found that it, as you would say, interacts. I guess I am using 
the word correctly. It interacts with these other factors more or less, 
sometimes; so I don't know. 

Stat: Not so good. How about the other three. 

Engr: In our previous studies, we have found that load and volume and 
temperature too are more or less additive to each other. Does that 
mean they do not interact? 

Stat: That's right. 


Engr: Additive. If they are additive, that means there is no inter- 
action. 


Stat: That is right. 


Engr: Then we can say load and volume and temperature do not interact. 
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Stat: Now, let's see what we have here. You have four volumes. Is 
that so terrifically important that you need four levels? 


No, we just wanted to cover the range from 1/2 to 2. 


E 


Stat: Would three be sufficient? 





All right. 


a: 


Well, if you left out one, which would you leave out? 





r: Oh, I think the 1-1/2. 
Stat: Good! Now that I have given you something there, how about you 
giving me another load. Wouldn't it be a goed idea to have three 
loads? 


I guess it would be. How about full load, 10047 


E 


Stat: Good! We have four directions, two speeds - that will make 8 
combinations - three loads - that will be 2 - three volumes - that'll 
be 72 - and three temperatures - that'll make 216 combinations 
altogether. 
Blackboard 

khx2x3x3x 3 = 216 
Engr: What! 216 combinations? 
Stat: That's what you have there. 


: And that is without running it three or four or five times? 
We can't do that, Miss Day. 


Stat: Don't get excited; give us a chance. Maybe we can do better 
than that. 


Engr: I hope so. 

Stat: We have covered the purpose, the factors, and their levels and 
interaction. There is one more matter, I think of now, to discuss. 
How easy is it to change direction? 

Engr: Oh, direction is easy, just a flip of the switch. 

Stat: I see, and speed? 

Engr: Same thing. 

Stat: How about load? 

Engr: Load takes sbout an hour to change. Volume takes five minutes. 
Stat: And temperature? You haven't said anything about that. 


Engr: Temperature. We can't change temperature within a day. We have 
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to heat up the whole tank of oil or cool it down and we can't do it 
within a short space of time. I would say it takes about four or five 
hours to change temperature. 


Stat: Well, it is good to know about this. Suppose you had a certain 
temperature and load and volume. How many could you do in a day of the 
speed and direction combination? 


: Oh, well, we calibrate every morning. Let's see, we can perhaps 
get eight or ten different runs changing speed and direction if we keep 
the others fixed - constant. 


Stat: That's not toe bad. I teslieve we can work something out. 
poet Oh, and there is one thing more I forgot to mention. Ome of 

e reasons for running different tests is to decide where to take the 
noise measurements. In this test, we are going to measure the noise at 
five different locations. 
Stat: Only five! 


: And then we are going to take tooth contact frequency and also 
the overall noise. 


Stat: Can these all be taken at one time? 

: When we have the gear system running, we take all of these at 
one time. The pick-ups are there. It is on tapes. We don't have to 
worry about that. 


Stat: Well, that won't be any problem then. It just means more data 
to analyze. 





Engr: But that is one of the reasons for running the tests. We hope 
to be able to cut down on the number of readings in the future. 


Stat: I believe I have the picture now. 


: You say we have 216 combinations. We just can't run them over 
and over and over again, Miss Day. 


Stat: Well, suppose we don't worry about that until I have time to 
think this over a little and see what we can come up with. Shall we 
take a look at your equipment? 


Engr: I will be glad to show it to you. You know, this time has not 
been wasted. Just talking about the problem has helped. If you need 
any more information, give me a ring. 


Stat: Thank you, I may come back for more. It certainly is an inter- 
esting problen. 


Engr: I am glad you think so. Let's go to the laboratory. 
Stat: O. K. 


END OF ACT I 








ACT II 
































Engineer's office - several days later. 
Stat: How does 144 or even as low as 72 runs sound for your problem? 


: Oh, 72 instead of 216. That sounds real interesting. Now 
you're talking up my alley. Proceed! 


Stat: I'11 put it on the board in the form of a little diagran. 











Blackboard 
Volume 
Load 1/2 1 2 
25% 120° go° 160° 
100% 90° 160° 120° 
125% 160° 120° 90° 

















Stat: Suppose we put volume across the top. We had three volumes - 
172, 1, and 2; and load over here on the left side. We had three loads 
- 25%, 100%, and 125%. Now let's put our temperature in these cells 
and we'll put one each in each row and in each colum. Starting with 
90° in the first row and second column; then we put it in the second 
row and the first colum, and finally in the third row and third 
colum. Now, let's see what this means. In the first case we'll run at 
25% load, 1 gpm volume, and 90° temperature. 


Engr: Is that what it means? 


Stat: Yes. Also we'd run the 125% load, 2 gpm, and 90°. Now let's 
follow through in the same manner for the other two temperatures which 
will completely fill our nine cells. Thus we now have nine different 
combinations. If we have made all the possible combinations of load, 
volume and temperature, we would have had 27. But here we have reduced 
it to nine. 





Engr: Yeah! We have to run the other 16. 


Stat: No, not for the present. Let's just think about these nine. 
Now, let's see what we'll get out of it. Suppose we toek the measure- 
ments for these nine different combinations and they were put in the 
nine cells. 





Engr: You mean, after the test has been run? 
Stat: Yes. You will note that, if you added across and got the total 


for the first row and the total for the second row, they would be 
balanced for volume and they would be also balanced fer temperature. 
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Engr: Wait a mimte, I will have to look at that. (Pause) You are 
right. 


Stat: So that the difference in these totals, aside from the error in 
running the tests, will measure the difference between using a 25% 

load and a 100% load. Likewise, if you took the third load and compar- 
ed it with the other two because they would be balanced throughout. Now 
we'll write that down. Load - we have two comparisons there, that is 
two independent comparisons. We can compare our 25 against the 100 and 
we can compare the two against the third. Now let's loek at the 

column totals which stand for the three volumes. Suppose we toek the 
first volume, the 1/2 gpm, and last colum, the 2 gpm. You will note 
they are balanced for beth temperature load. So the difference in the 
totals over and above the errors in the test will be due te the differ- 
ence in the two volumes. Do you see that? 


Engr: I see. It is balanced. 


Stat: Yes, well, then if you take the high against the low, then the 
other would be the twe ends against the middle. Se you would have two 
comparisons for volume. The next step is a little more difficult to 
see. But leek at our temperatures. 


r: You mean to say that temperature is balanced too? 
Stat: Well, didn't we put it in a balanced way in the beginning. 


Engr: Well, let's see. (Werking on board) 90° here lead 1; 90° here 
load volume 1/2; 90° here volume 2 load 125, 120, 120, 120, 160, 160, 
160. It is balanced as ef the temperature. 


Stat: Se, we'll put in two comparisons fer temperature. We have nine 
different runs and there will be eight comparisons. That is one thing 
you have te take on faith; adding these up and subtracting from the 
eight, leaves two fer the remainder, that is for measurement of the 
errer in the test. 


: Error? As I see it here, yeu only have nine combinations out of . 
the twenty-seven. We are not running any of these over again. It seems 
te me we are getting something for nothing. 


Stat: Well, net exactly. Yeu remember last Menday when we discussed 
this you assured me there were no interaction effects of volume, lead, 
and temperature. It was fer that reasen we could use this plan. We 
call this the “latin Square". The "ne interaction” statement made 
that pessible. 


Engr: Latin Square? Wait until I tell my Senator about this. Well, 
are we going te run the ether parts of that twenty-seven? 


Stat: Not unless we find we need te. We'll run these first and see 
hew well the data come out. 


: Oh! But wait a mimte, you have fergotten the mest impertant 
problem in the test; speed and direction. I mean, that is the reasen 

















for running the test. 


Stat: I haven't forgotten them. I'm just coming to that part. You 
had four directions and two speeds, or a total of eight combinations 
with speed and direction. 


Engr: Let's see, we have 300 and 1200 top inmesh, 300 and 1200 top 
tangent, 300 and 1200 bottom inmesh, 300 and 1200 bottom tangent - 8. 


Stat: You're doing fine. Now you also said that you could run 8 or 
10 in a day. As you will see in the drawing, what I have done has 
been to arrange to run 8 combinations of speed and direction for a 
particular volume, load and temperature in one day. Can they do that? 


: You mean we set the volume, load and temperature and then 
change the speed and direction? yes, we can run eight. 


Stat: Fine, so this will represent altogether 9 days of work. 


: Right - nine days. Oh, and in this way you take care of 
temperature. You remember we couldn't change temperature in a day? 
I was wondering what you were going to do about temperature. 


Stat: That's right. Well, now, let's see what comparisons we get 
with this in the same way as before. We will call the part above the 
main part, the comparisons for speed and direction we call the subpart. 
This is a case where the sub-part is more important than the main part. 


Engr: It seems to me like a backward way to do it this way, Miss Day. 
Stat: Yes, it does. 
Engr: Is this the way all statisticians work? 


Stat: I guess so. If we have 2 speeds, we can only compare one speed 
against the other, so we have one comparison for speed, and a very 
goed comparison toe for we will have four of each speed in each one ef 
the nine days. Direction - we'll have three comparisons there. We 
have top against bottom; inmesh against tangential; and then the inter- 
action of the two. What we also have is the interaction effect of 
speed and direction. Remember, you said you wanted te find out this. 
The neat part about this plan is that we can find the interactien of 
speed with each of the three above also - load, volume and temperature. 
Also, the interaction of direction with each ef them. We will have a 
total of 72 readings. 


Engr: Nine days, eight a day -- seventy-two, right. 

Stat: So we'll have 71 comparisons, and we'll have a remainder here, 
or errer term, of 32. You may be interested to know that this sub-part 
is called "split plot”. 

Engr: I thought it was Latin Square? 

Stat: Well, the whole thing is a "latin Square Split Plet" design. 


Engr: Latin Squére split-plot - wait until I tell Mike about this. 
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Stat: Now, the valuable thing about this is that we'll have a very 
good measure of the effect of speed and direction, alse a& good measure 
of the within-day variability or error. Is that all clear? 


Engr: Well, frankly, there is a lot about this I don't understand, 
but I'll try anything once. Suppese I copy down these combinations 
and we will start running. 


Stat: Wait a minute. There's one thing more that I saved until the 
last. And that is we want to make sure that no systematic changes 
in your machinery gets inte the test. So, what I had done is to 
randomize the order in which these nine days will be run. 


Engr: Randomize? 

Stat: Yes, I've actually left it te chance which one would come first, 
second, etc. In the drawing here, the first ene te be run is the one 
with the 1 gpm, the 25% lead, and the 90°. Do you see any objections to 
this? 


: No. We have to set it at something every morning. We can set 
it at anything you prefer. 


Stat: Fine. 


Engr: Except, I was going to test all the 90°'s first, then the 120°'s 
and then the 160°'s. It is the way we always do. 


Stat: This is better for it is insurance against some of the system- 
atic effects getting into the readings. 


Engr: Okay, and we will set them at whatever you prefer. We can fun 
all the 300 rpm's in the morning and all the 1200's in the afternoen. 


Stat: Well, now, wouldn't it be a goed idea to be just as careful with 
the within-days runs to avoid systematic changes we call bias there? 


Engr: Well, we know our equipment changes during the day. We 
calibrate every morning and we know it dees not stay on calibration 
all day long. But I don't see how you are going to take care of that. 


Stat: Well, we will do it in the same way by leaving to chance the 
order in which they are run. And I believe you told me in the 
beginning that it was very easy to change from one speed and direction 
to anether. 


Engr: Yes, but what is wrong with doing it the way we have always 
been doing it? We will run the 300's in the morning and 1200's in 
the afternoen. What's wrong with that? It's a nice break. 


Stat: Yes, but if there's a change in drift, that drift might coincide 
just with your changes in speed. I have prepared a random order of runs 
for the first day. Would you take a loek at it and see if it can be 
done without toe mich upsetting of the works? (Handing first day's 
schedule to Engineer. ) 





Engr: I see you mixed up the speeds and directions. You randomized? 


Stat: That's right. You're a very apt pupil. We'll furnish a differat 
order for each test run. They're in the process of being typed up now. 
We have set it up in a way I believe the technicians can follow. Will 
you see if it needs any revision? 


: This part up above is what we set and then during the day we 
change the directions and the speed and we will follow this order. 


Stat: That's right. And then there'll be another one for the next 
test run. 





r: A different order? 


Stat: Yes. And this is what we call randomization, leaving it to 
chance. 





Engr: This is new to me, Miss Day. This is not the way we usually do 
it. Let's spend a minute on it. I would like to be clear as to why. 
If there were a drift and we always ran the 300 speeds first, the 300 
speeds would be a little lower than they Would be because of the drift? 
Stat: That's right. Or a little higher. 

Engr: Or a little higher. And the same with the 1200's? 

Stat: That's right. 


Engr: Oh, I see, so you mixed them up like this, so in the end the 
picture will be as it should be. Okay. 


Stat: Yes. Both speeds have an equal chance of being affected by the 
drift? 





Engr: I hope that you are not going to tell us we should run this over 
five or six or ten times. 


Stat: Oh, no, let's run it this once and analyze the data and see what 
we get from it. It may be necessary to make a second run, but again it 
may not. 


Engr: This is quite different than what I had in mind, frankly. We 
will see how it comes out. 


Stat: Fine, thank you. That is all anyone could ask - to give it a 
trial. 
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Figure 2- 3 x3 Latin Square - Split Plot Design 
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TEST SCHEDULE 


First day of 9 days 
Preliminary Gear Test for Optimum Operating Conditions 


First Day 
Main Setting 
Load 25% 
Volume 1.0 gal/min. 
Temperature 90° 
Order of Speed 
testing Direction (epm) 
(1) Tangential - bottom 300 
2) Inmesh - top 1200 
3) Inmesh - bottom 300 
4) Inmesh - top 300 
5) Tangential - top 1200 
33 Tangential - tep 300 
7) Tangential - bottom 1200 
(8) Inmesh - bottom 1200 





ACT III 
Statistician's office (three weeks later) 
Engr: Hello, Miss Day. 
Stat: Helle, come in. 


: It's been a couple of days since you had the data and I wonder 
if you have it analyzed. 


Stat: Well, now, you do expect fast work. We are goed, but not that 
goed. However, we do have the first one of the sets in shape. 





: Okay, is that the tooth contact frequency at the first location - 
the one I thought was most important? 


Stat: Yes, that's right (handing set of tables to engineer). 
Engr: You mean this table here? 


Stat: In Table 1, we have the data unshuffled, as you might say, or in 
the order of the original chart. And also we have the totals there. 
You will recall we talked about the totals cf the rows; for instance, 
for 25% load, we have a total of 2063; for the 100%, a total of 2057. 


Engr: Are these the figures on the last colum here? 
Stat: Yes, and for the 125%, 2098. 
Engr: How can you make any sense out of these data? 


Stat: We have a very powerful teel in Statistics called the analysis 
of vai variance, whereby we can break down the total variation among all 
these readings. If you will notice, they range from 61 up to 95.9. 
Just as you have measure for noise or length, the statistician has a 
means for variation. So we can break down the total variation and get 
@ measure of the effect (now, that's a new word), that is, the amount 
ef variation that is due to a change in load er volume or speed or 
direction or combinations of those. I know that is pretty difficult 
to fellow. 


Engr: Well, so far it is just words. 


Stat: Well, let's take a loek at the table and see if that will help 
out. Look at Table 2. That's the table with direction across the top 
and speed along the side. Now let's look at the totals for directions. 
They are 1549, 1555, 1570, and 1543. They are all pretty close to- 
gether, aren't they? 


Engr: Yes. They seem to be. 
Stat: Well, we can actually, by a little arithmetic, measure the effect 


of th the differences, that is, how important they actually are. Now, leek 
at the twe speeds; 300 rpm is roughly 3200, and 1200 is roughly 3000. 
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Quite a lot of difference, isn't there? We will also measure the 
importance or significance of that difference. 


Engr: Can you show me physically? 


Stat: We have done the arithmetic for this breakdown. It appears in 
Table 4. If you will notice, the first colum of this table are those 
comparisons we talked about the other day; only we call them degrees of 
freedom. You don't mind, do you? 


Engr: I don't know what they mean anyway, so it doesn't make any 
difference what you call them. 


Stat: Then we have, as I said, broken down the total variation into 
the part due to each factor and their interactions. First, we got the 
total variation by finding the average of all the readings, then sub- 
tracting it from each of the 72 readings, squaring each of these 72 
differences, and finally summing the squares. That is the total found 
at the foet of the second colum. By some more arithmetic, we broke 
this dewn into the part due to each of the items in the left side head- 
ing. Next we get the average effect of each one in the last colum by 
dividing each by its corresponding degrees of freedom. 


r: I hope you don't expect me to get all this. 
Stat: Well, not all at once. New look at the last item in your table, 
that 15.75. If you toek the square roet of that, you would have the 
standard deviation - that is familiar to you, isn't it? - and it would 
be in decibels, approximately 4.0 decibels. 


Engr: I have been reading abeut standard deviation in journals. It 
seems that you statisticians are getting in everything these days. 


Stat: We're trying to, anyway. We're infiltrating. 





Engr: And that is a standard deviation? And that is what is used in 
nermal curves? 


Stat: That's right. That is the error tern. 





Engr: Did you say that was in decibels? 
Stat: Decibels, yes. 


Engr: Se that applies to this test physically? And it is a quantitive 
measure. It is quantitive. 


Stat: That's right. The 15.75 is a measure of the within-day variation 
or error. Now let's look up the other error term in the main part - 
that is 8.225 - that measures the day-to-day error. 

Engr: Well, that is smaller than the other. 


Stat: Only slightly so. 








Engr: Do these things become more important as they become larger or 
smaller? 


Stat: Oh, as they become larger. The smaller they are, the more 
sensitive is your test. 





Engr: So this means that our day to day variation. . . 
Stat: .. . is no greater than the within day variation. 


Engr: Well, that is quite interesting because we are spending a let of 
time calibrating the equipment every morning; and this means that at 
least fer these two weeks we have the calibration under control? 


Stat: Exactly. Now, we use this error term, the 15.75 te decide 
whether or not a factor in the sub-part is important. If it is no 
larger than that, then that factor is not important in the test. 
We'll start at the bottom and work up. The first three, direction by 
lead, volume and temperature, aren't sufficiently different from the 
15.75 te be considered significant. Neither is direction with speed, 
8.81 impertant. Hence, we can conclude that direction does not 
interact with any ene of the other four factors in the test. 


Engr: That's really interesting information. 
Stat: Now let's leek at direction itself . . with a mean square of 7.75. 


Engr: I gather that direction is not important either, since it is 
that small. Well, wait until I get back te the lab. You know, we had 
@ little argument, friendly, of course, as to this direction factor; 
and this is evidence that it is not impertant. This means that we in 
our test work could just as well only run one direction? Well, goed 
deal! That will cut our test work dew te one-fourth. 


Stat: That's right. And it is very goed evidence tee, because it is 
based on 16 readings of each one of these 4 directions. New, let's loek 
at Sx L. I have prepared a table of speed - load tetals, Table 3. 

If we pletted load along x-axis and decibels on the Y-axis, the totals 
for 300 rpm weuld go down a little at 100% and up slightly at 1°5%. 


Engr: Well, that is about flat, isn't it? 

Stat: Yes, we find 1200 starts at 964 and goes to 990 and then on up to 
1020. Now loek at the measure of that effect; that is, load times 
speed. We get 68.34, which is quite large. We compare it with our 
15.753; that is, we find the ratio of 68.34 te 15.75. 

Engr: How de you decide whether it is large enough? 

Stat: Well, it has all been worked out mathematically, but I don't 
think we need to go into its nature here. This ratie of the number to 
the error term is 

Engr: Ratios? Oh, you use ratios? 


Stat: Yes, we called these ratios, F ratios. Where "F" happens te 
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stand for Fisher, the man who is most responsible for this technique. 
Engr: You mean he is the one to blame? What de you use, formas? 


Stat: No, tables have been worked out which tell us just how strong 
the evidence is that a facter is important. 


Engr: Oh, you use tables. I see. 


Stat: Well, we could ge on... 


of 
Engr: I have an appointment seen, Miss Day. I will tell you, from the 
leeks of this, it seems I can assign the equipment up te some ether 
preject because I don't think we have to run this over again. 
Stat: Well, I don't think so, either. 

r: We seem te have eneugh information; and it is all straight- 
forward, assuming, of course, the other lecatiens give the same answer. 
Stat: We will werk up the rest ef the data and write you a report on it. 
: Well, actually, Miss Day, this has been an experience and I am 

especially interested in this technique of analysis. That strikes me as 
being very pewerful. You knew, I have been avoiding your class in 

T5- Statistics. Perhaps I should sign up fer your class. 
Stat: We would be very happy te have you. It has been a pleasure 
werking with you on this. 
Engr: And I will wait fer your mem. 
Stat: Yes. 

pk 


END. OF ACT III 
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THE QUALITY CONTROL PROGRAM AT 
MINIATURE PRECISION BEARINGS, INC, 


Charles J. Hudson 
Miniature Precision Bearings, Inc. 


Ten years ago when the American Society for Quality Control 
was in its infancy our conversations at meetings centered around intri- 
cate detail, many charts, record keeping by machine operators and 
others and reports issued daily, weekly, and monthly. The mathema- 
tics of quality control procedures including probability calculations 
were emphasized, 


In 1958 simplification is the order -- to minimize our record 
keeping and to retain and circulate only the most important reports. 
We make our quality control efforts simple and understandable parti- 
cularly to the machine operators who are the people who can correct 
plant difficulties causing rejections, reworks and high costs. 


What plant exists where it can be truthfully said that the quality 
control work is completed and nothing remains but routine work? At 
Miniature Precision Bearings, Inc., as in most other plants, there is 
still much to be done to correct difficulties resulting in product losses, 
New problems occur frequently. 


The product manufactured at Miniature Precision Bearings, Inc. 
is instrument and precision ball bearings. About five hundred different 
kinds of bearings are made, classed as radial, thrust, pivot and angu- 
lar contact. The outside diameters of these bearings range from 
3/8ths inch down to .059", The smallest weighs only .02 grams, They 
are used in precision instruments and delicate mechanisms such as 
gyroscopes, recorders, synchros, potentiometers, metering devices, 
etc. where light weight, long life, accuracy and precision, low torque 
and a minimum of space and weight are among the requirements, 


The problems in the manufacture of miniature ball bearings 
center around holding dimensions of inner and outer rings to .0002" 
or less, the sphericity and size of the balls to within a few millionths 
of an inch and surface finishes of critical surfaces to within 1/2 micro- 
inch, Concentricities and parallelism of sides or faces are of prime 
importance, Operating factors such as low torque and low noise re- 
quirements are frequent customer specifications. 


To obtain close tolerances quality control methods are used for 
determining machine capabilities. Because parts are small and must 
be held to narrow limits, machines producing these parts must be held 
very close to, and sometimes better than, the manufacturer's recom- 
mendations; as a result it is necessary to make extensive use of mach- 
ine capability studies. Frequency distribution charts are often relied 
upon and sampling inspection by roving inspectors during the manu- 
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facturing process is prevalent. Roving inspection and operator inspec- 
tion is followed by a system of toll gate inspection where parts are 
examined at the end of individual or a series of related manufacturing 
operations, the "toll'' being compliance with specifications, 








The problems of maintaining close tolerances in the manufacture 
of the tiny components entering into small assembled bearings are 
many. For example, commercial gages for measuring some of the 
requirements are often not available from the usual source of supply 
for larger ball bearings, so must be manufactured in the plant instru- 
ment shop or purchased as special equipment from gage manufacturers, 































The cross sectional dimensions of the outer ring are so small 
in some cases that the ring can become distorted and an erroneous 
diameter reading can result from measurements unless contact pres- 
sures are kept under two ounces. Amplified electronic or air gage 
methods are frequently used, Instruments for measuring noise, torque 
values and radial and axial play frequently must be made, A high per- 
centage of the employees are supplied with microscopes to aid in manu- 
facturing detail, 


Dimensions must be carefully controlled to comply with, or 
surpass, industry standards as established by the Annular Bearings 
Engineering Committee, to assure proper fits at assembly and to meet 
customer requirements, Some dimensional tolerances are held to 
within a few millionths of an inch, In addition surface finishes of cri- 
tical surfaces such as balls and ball races must be measured and con- 
trolled to within narrow tolerances -- 1/2 microinch in most cases, 


Methods of maintaining small dimensional tolerances and sur- 
face finish requirements are similar to those described in quality con- 
trol literature. The chief differences are the need of careful control 
of gage wear, manipulation of gages and adaptation of refined methods, 
The need of a good metrology section and cooperation with it is of 
course a requirement, 


Miniature ball bearings, like large ones, are composed of 
three essential parts -- an outer ring, an inner ring and a complement 
of balls between the two located in the raceways ground into the two 
rings, Balls of only .015" in diameter are used in some of the bear- 
ings. 


Manufacture begins with special steel rod, about twelve feet 
long, fed into standard screw machines of the smallest type. Inner 
and outer rings are turned with the use of tungsten carbide tools to di- 
mensions approaching the finished requirements in these machines, 


Incoming acceptance sampling is performed on the rods and 
consists of chemical and metallurgical examinations. The samples are 


examined in the laboratory of the Research and Development section 
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which works in close cooperation with MPB's Quality Control Depart- 
ment. At the screw machines, samples are taken by the operators 
and also by the roving inspectors. Performance is examined and con- 
trolled principally by simple frequency diagrams, 


After hardening, all subsequent metal removing operations are 
performed with abrasives by grinding, lapping or honing. During these 
operations, samples are taken by roving inspectors at periodic inter- 
vals and examined for dimensional and visual characteristics, 


The emphasis on examination is simplicity, The ideal condi- 
tion is, of course, to have the operator on each machine do his own 
inspection. The roving inspectors obtain the cooperation of operators 
and supervision by explaining the nature of any defects. 


Easy to read records, cooperation with operators and foremen, 
early summarization of records showing machine performance, de- 
partment performance and overall plant performance are some of the 
steps which make roving inspectors! work effective. 


Major steps in manufacture after the screw machine operations 
are: 


heat treating 

lapping the sides of the rings 

grinding OD and ID of both inner and outer rings 
grinding the ball races in both rings 


At each of the above steps sampling inspection takes place at frequent 
intervals by roving inspectors to make sure that parts are satisfactory 
to pass along to the next operation. Examinations consist of inspections 
for: 


dimensions 

proper hardness after heat treating 

concentricities of ball races to outer and inner dia- 
meters 

parallelism of ball races to sides or faces of inner and 
outer rings and of the sides themselves 

surface finishes of the ball races 


There are many other inspections such as grooves in the outer 
ring to hold thin discs only .003" thick which act as shields on the sides 
of some bearings, proper dimensions of filling slots on full race bear- 
ings used to insert the final ball, and many special requirements. 


Because the parts are so small and of low weight, transporta- 
tion through the plant is no problem, Frequent use is made of an exa- 
mination room, or toll gate, conveniently located, This room is com- 


pletely equipped with fine gages, frequently checked with master gages, 


241 





for the examination of critical dimensions. Product flows through the 
plant in numbered lots, The lots are brought to the toll gate at fre- 
quent intervals where they are subjected to sampling inspection using 
the Mil. Std. 105A single sampling tables for determining conformance, 


Inspection procedures reveal "defects" and "rejections". Def- 
ective parts can be corrected by returning them to the proper factory 
departments. Rejections, of course, cannot be salvaged, 


Miniature Precision Bearings, Inc. make bearings to a single 
high standard of quality known as ABEC-5 grade or better. This pro- 
cedure assures uniformity of product to all customers, 


Having passed the toll gate final inspection, the next step in 
the manufacturing process is the final assembly operation. When parts 
reach this department they should be satisfactory and meet all dimen- 
sional and geometrical requirements. In addition they have been 
thoroughly cleaned, 


To perform satisfactorily miniature bearings must be absolutely 
dirt free. Therefore, a great deal of attention and care is taken to 
maintain the assembly area, the "white room", suitable to assure dirt 
free bearings, The white room is held to closely controlled tempera- 
ture and humidity; it is constructed free from obstructions so it can be 
cleaned easily every day; employees wear lintless clothing; the white 
room area is kept under positive air pressure with entrance through 
air locks, 


The cleaned parts enter the white room through a single window, 
Following assembly there is a final inspection, after which the bear- 
ings are lubricated with special lubricants, tightly packaged and made 
ready for shipment. 


Oftentimes orders are accompanied with special requirements 
which can be met best by selective assembly. A common requirement 
is closer than normal radial play which is usually held to limits of 
. 0002" to .0008" resulting in reasonable clearance for minimum torque 
and medium speed applications. Low torque requirements, both start- 
ing and running, is another example of special specifications. 


A program under the supervision of the Quality Control Depart- 
ment is a continuous torque evaluation plan. Torque, ease of rotation, 
is a property of ball bearings which is desirable to keep low even 
though not often specified in orders, The program is helpful to engin- 
eering and manufacturing in improving a quality characteristic of grow- 
ing importance, 


A vendor evaluation program is also a part of the program. 


Sampling inspection of incoming products entering into the manufacture 
of miniature ball bearings is carried on continuously. Reports to 
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vendors have been mutually helpful with an improvement in vendor 
products resulting. 


Quality Control methods, in the manufacture of tiny products 


manufactured to close dimensions, are readily adaptable and can be 
used as readily as in the manufacture of a larger product, 
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A DESIGNED EXPERIMENT TO EVALUATE SEVEN YARN LUBRICANTS 


Hubert M. Hi21!®! ana Douglas Wheeler! ?! 


Research Laboratories and [bl varn Development Division, 
Tennessee Eastman Co., Division of Eastman Kodak Co., 
Kingsport, Tennessee 


[a] 


INTRODUCTION 


This experiment was designed to evaluate seven yarn lubricants and 
two size formulas. The method of evaluation consisted of preparing warp 
beams with the proper lubricant and size combinations and weaving them 
into cloth. The cloth was then evaluated for warp defects. The par- 
ticular design chosen made possible the use of a number of looms for 
weaving the cloth. It was desired that the experiment show any differ- 
ence present between the lubricants and allow the best lubricant to be 
picked from the group. 


EXPERIMENTAL DESIGN 


The basic design in this experiment was a replicated 2 x 7 facto- 
rial design. In addition, a further complication was present in that 
it was necessary to use several looms in order to finish the weaving in 
a reasonable time. Since the warp beams were approximately 1,500 yd. 
long and would require about 15 days of weaving time, weaving one com- 
plete replicate on a loom would require nearly seven months. Therefore, 
in order to obtain a satisfactory design, it was decided that the block- 
ing would be accomplished by the use of a randomized incomplete block 
design. The combination of factorial designs with randomized incomplete 
block designs was described by Kramer and bradley [1, 2). The particu- 
lar plan used in this study was S-40 [3], which is shown in Table I. 
This plan involves 14 varieties replicated three times in seven blocks 
of six. Kramer and Bradley showed that the 2 x 7 group of the plan can 
be associated with a 2 x 7 factorial, as indicated in Table II. The 
experimental design therefore required that three warp beams be prepared 
from each lubricant-size combination. These were then woven into cloth 
on @ group of seven standard Draper looms, the warps being assigned to 
looms on the basis of the plan shown in Table I. The warps were run on 
the looms in random order. 


COLLECTION AND ANALYSIS OF DATA 


The cloth produced from each warp was cut into approximately 100- 
yd. sections and evaluated by visual inspection. The defects were rated 
on the basis of the scale shown in Table III. The total score and the 
length of each cut were recorded. Data from the first and last cuts 
were not utilized because previous evidence had shown them to be signi- 
ficantly higher in score. 


Table IV shows a typical group of three beams from the same 
lubricant-size combination [LgS;]. It should be noted that not only is 
there a considerable difference in the averages for the thre= warps 
which were woven on different looms but there is a difference between 
cuts in the same warp. 











Because the scoring scheme is essentially a weighting system for 
discrete counts, which were expected to be Poisson in nature, it was 
necessary to transform the data. Experience from previous experiments 
indicated that either the Vx orVx +Vx+l transformation should be used. 
In this experiment it was possible to calculate the mean and variance 

of the values within each warp beam. These values indicated that on the 
average the ratio of the standard deviation to the mean was approximately 
1.0, which would indicate a log transformation. However, the wide range 
[0.34 to 1.77] of the ratios casts some doubt on this transformation. 
Also, the same conclusions were reached by using the log or the Vx trans- 
formation. Therefore, the Vx transformation was used. 


The adjustment of the treatment totals to obtain the treatment ef- 
fects is shown in Table VI. The Tjj values are the totals for the three 
beams for each treatment. The Bij values are the totals for the three 
blocks in which the treatments occur. The t;j values, which are the ad- 
justed treatment effects, were calculated using the following formula: 


u My 


tay - */s Ty + */o1 


n My 


T33 - */1s B,4, - */126 Bi} 
1 Ad 1 


J J 
The other values are self-explanatory. A ready check on the calculations 
is available because <= th = 0. 

From the tij values en analysis of variance is possible using the 
information given in Table VII. The analysis of variance as calculated 
is shown in Table VIII. Although the adjusted treatments do not show 
any significance, the lubricants are significant at the 90% level. 


Returning to Table IV and the distribution of values within a warp, 
it is believed that some of the higher values are caused by some vari- 
able other than those studied in this experiment. If our assumption of 
a Poisson distribution is correct, it should be possible to determine if 
any of the values are excessively high when compared with others in the 
same warp. As a test for high values in a Poisson distribution, we used 
Molina's Tables of Poisson's Exponential Binomial Limit [4]. The calcu- 
lated average for the warp was used to check the values against the cor- 
responding Poisson distribution, and any count which had a probability of 
occurrence of less than 0.001 was excluded. A new average was then cal- 
culated and the remaining data were checked against this new average. 
This test has been used in previous experiments, and in each case except 
the present one, has resulted in reduction of the variance to the value 
expected from a Poisson distribution. 


By means of this test, approximately 18% of the cuts were removed 
as having excessive values. These adjusted data were then analyzed. 
The results of the analysis are shown in Table IX. In this case the lub- 
ricants are significant at the 95% level and closely approach the 97.5% 
level. Once again, neither the sizes nor the. lubricant-size interaction 
are significant. 


_The treatment averages can be calculated from the grand average and 
the tj, values. These averages for the analysis both with and without 
high counts are shown in Table X. The order of the ranking in the two 
procedures is essentially the same and the values appear to separate 
into three groups. However, when the data are compared with the 
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confidence limit of the difference of two means it is apparent that lub- 
ricant 3 is different from lubricants 4 and 7 but is not different from 
lubricants 1, 2, 5, and 6. In the same manner it can be shown that no 
distinction can be made between lubricants 1, 2, 4, 5, 6, and 7. These 
two groups are indicated by the solid vertical lines. This conclusion 
is substantiated by Duncan's Multiple Range Test [5]. Removal of the 
high values in the data resulted in an over-all higher significance 
level without changing the ranking significantly. 


One other analysis of importance which was conducted consisted of 
calculating the percentage of fabric that was removed because of high 
scores. These data were transformed as the arc sinVx. The results of 
an analysis of these data are shown in Table XI. In this instance the 
lubricants show no indication of effect, whereas the sizes exceed the 
90% level. The lack of any evidence of a relation between the lubri- 
cants and the occurrence of high score on cuts within warps indicates 
that the procedure for exclusion of high values is satisfactory. 


CONCLUSIONS 


The results of this experiment indicate that lubricant 3 is 
superior in performance to lubricants 4 and 7, but cannot be distin- 
guished from lubricants 1, 2, 5, and 6. Lubricants 1, 2, 4, 5, 6, and 7 
also form a group in which no differences in performance can be demon- 
strated. There is some evidence that the amount of excessively high 
scores is related to the size which was used on the warp. 
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Table I 
Incomplete Block Design S-40 
velb, r=3,k=6, bd=7, m= 7, n= 2,A1 = 3,An=1 











Groups Plan 

1 8 1 8 2 9 \ 11 
2 9 2 9 3 10 5 12 
3 10 3 10 a. 6 13 
4 11 2’ 2 5 12 7 14 
5 12 5 12 6 13 1 8 
6 13 6 13 aa 2 9 
7 14 7, 1 8 3 10 

Table II 
Association Scheme 

LiS; [1] LiSe2 [8] 

LoS, [2] L2Se2 [9] 

IeS1 [3] IgSe [10] 
14S. [4] L4Se [11] 
IsS1 [5] IsSe [12] 
IeSi [6] LeSe [13] 
L7S1 [7] L7So [14] 


Table III 
Scoring System for Defects 
Points 
——y—— 
Major defect 4 
Half major 4 
Minor defect 2 
Half minor 1 


248 











Warp No. 
ow 

2 

3 

k 

5 

6 

7 

8 


9 
10 
11 


Total 


Average per 
100 yd. 


Adjusted average 
per 100 yd. 


Table IV 
Scores on Treatment LeSi 





2k 25 26 
Ya. Score Yd. Score Yad. Score 
130 re) 130 e) 90 12 [a] 
102 2 « - 90 88 
142 0 145 0 93 8 
108 4 109 2 89 8b] 
110 14 119 0 87 4O 
173 2615) 80 2 70 8 
86 18) 4] 80 3 108 8 
91 ko 136 Ore] 73 Tra] 
91 8a] 100 8 93 Ora] 
90 72 106 0 89 98 
1,123 186 1,005 15 882 541 
16.56 1.49 38.66 
6.30 0.77 9.75 


[a] Removed on first test. 
[b] Removed on second test. 
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Table VII 
Intra-Block Analysis of Variance 























Source of Degrees of Sum of 
Variation Freedom Squares 
Treatments 13 3z=z tt - */5t tf, 
[adjusted] 
Lubricants 6 14/.2 FT 
[adjusted] , 
Sizes 1 212 t4 
[adjusted] 
Lubricants x sizes 6 by subbraction 
[adjusted] 
2 
Blocks 6 4.3 B - — 
[unadjusted ] 
Error 22 by subtraction 
2 
2 GT 
Total 41 zzz Ti 4s -= 
Table VIII 


Analysis of Variance 
[All data included] 























Source of Degrees of Sum of f fal 
Variation Freedom Squares Mean Square 
Treatments 13 25.08 1.93 
[adjusted] 
Lubricants 6 20.66 54h 
[adjusted] 
Sizes 1 0.01 0.01 
[adjusted] 
Lubricants x sizes 6 47 0. 74 
[adjusted] 
Blocks 6 59.86 - 
[unadjusted] 
Error 22 35.92 1.63 
Total 1 120.86 


[a] Significant value at a = 104 — 3.36 
Significant value ata = 5% — 4.16 
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Table IX 
Analysis of Variance 
[High cuts removed] 























Sources of Degrees of Sum of fal 
Variation Freedom Squares Mean Square ° 
Treatments 13 20.55 1.58 
[adjusted] 
Lubricants 6 14.81 2.47 
[adjusted] 
Sizes 1 0.47 0.47 
[adjusted] 
Lubricant x size 6 5.27 0.88 
[adjusted] 
Blocks 6 42.68 - 
[Unadjusted] 
Error 22 18.05 0.82 
Total 41 81.28 
[a] Significant value at a = 54 — 2.09 


Significant value at a = 2.5%4— 2.50 


















































Table X 
Ranking of Lubricants 
All Data High Scores Removed 
Adjusted Warp Adjusted Warp 
Adjusted — Average Adjusted — Average 
Average [Score/100 Yd. } Average [Score/100 Yd. ] 

Ig 2.17 4.71 Is 1.69 2.86 
Lz 3.27 10.69 Les 2.60 6.76 
Li 3.32 11.02 Ig 2.64 6.97 
Ls 3.40 11.56 Li 2.69 7.78 
le 3.64 13.25 Le 2.81 7-90 
Tae 4.47 19.98 Lr 3.61 13.03 
Lr 4.79 22.9h ly 3.93 15.44 
[a] Variance [a, - ay] = 0.70 {b] Variance [a, - ay] = 0.35 

98% Confidence limit of 98% Confidence limit of 

difference = 2.10 difference = 1.48 
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Table XI 
Analysis of Variance 
[Percent of fabric removed due to excessive scores] 




















Source of Degrees of Sum of fal 
Variation Freedom Squares Mean Square 
Treatments 13 0.606 0.047 
[adjusted] 
Lubricants 6 0.195 0.033 
[adjusted] 
Sizes 1 0.124 0.124 
[adjusted] 
Lubricant x size 6 0.287 0.048 
[adjusted] 
Blocks 6 0.554 * 
[unadjusted] 
Error 22 0.841 0.038 
Total ki 2.001 
[a] Significant value at a = 104— 0.112 
Significant value ata = 5%— 0.163 














STATISTICAL CONTROL OF A SPINNING LABORATORY 
Philip R. Ewald 


U.S.D.A. Spinning Laboratory, Knoxville, Tenn.* 


Introduction 


A spinning laboratory is essentially a yarn mill in minature with 
most of the mill's attendent control problems. When the mill handles 
cottons of all types, as is the case for Knoxville Spinning Laboratory, 
there are introduced into the control problems variables of a nature 
which require the application of the science of statistics. (1) When 
the mill is operated as a laboratory, its instrumentation becomes very 
important; measurements must be carefully made to insure meaningful 
control. In a spinning laboratory then, we are immediately required 
to develop an adequate statistical instrumentation, (2) 


Fiber Analysis 


The first requirement in the processing of cotton yarns is an 
analysis of the raw material, the cotton fibers. Nature has intro- 
duced such variability into the fiber properties of cctton that in the 
measurement of these properties, the science of statistics becomes 
every bit as important as the science of instrumentation. Measurement 
problems resolve essentially into the job of determining mean values 
and just how good the means are. There are of course two approaches 
to the problem, first, homogenize the population to the point where 
only a few measurements are needed, or second, take enough measurements 
to establish a mean on the population as is. Either procedure could be 
expensive if carried to an extreme. As it turns out, in normal 
practice, a combination of the two methods yields the optimum results. 


To establish a reference standard, extreme homogenization was 
carried out on a batch of well known cotton by effectively doubling 
some 392,000 times in the mixing process. Then, to establish the 
statistical accuracy of the instruments themselves on the homogenous 
cotton, replicated measurements were made on 100 samples. The results 
are given in Table 1. The fineness and immaturity, upper half mean 
and mean length, and strength and elongation are measured respectively 
on the Arealometer, Fibrograph and Stelometer developed at the Univ- 
ersity of Tennessee by Dr. K. L. Hertel and co-workers. Generally, 
where applicable, the measurement techniques follow the ASTM standard 
methods. (3) The data in Table 1 is the basis for establishing a 
control chart for the laboratory, an example of which is shown in 
Figure 1. Check measurements are made on the standard once each day 
at a random hour and if the meesurement is over the control limits of 
2 standard deviations away from the mean, there is an alert. If the 
descrepancy is repeated, an investigation is made. As a safety 
measure for those cases where there is a completely unexplained drift 
outside the control limits, and such cases have happened, the laboratory 





* A unit of the Agricultural Research Service, Crops Research Division, 
Cotton and Cordage Fibers Research Branch; cooperating with the 
University of Tennessee Agricultural Exmeriment Station. 
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uses a correction factor based on the average of the past 10 day's 
measurements on the standard. Such a procedure of course helps to 
correct for instrumental drift even though it is within the control 
limits and no corrective measures are required. 
Table 1 
Standard K-55 Cotton 
Fiber Analysis at 70°F. and 65% R.H. 


Values for 100 determinations on completely blended sample. 


Mean Std. Dev. CV% Normal No. Determinations 
A Fineness Lo 4.8 me 2 
D Immaturity 26.6 3.6 13.5 2 
UHM Length 1.21 202 1.7 2 
M Length 1.03 203 2.9 2 
Tj Strength 1.88 06 3.2 4 
Bj Elongation 8.8) 38 4.3 4 


As for the routine measurement of fiber properties, an approxima- 
tion to a homogenized population is obtained by taking 10-15 pinches 
from the sample under test to give a 5-10 gram sub sample. About 0.5 
gram of this sub sample ts resampled onto the Fibrograph combs and then 
homogenized. After length measurements are made, about 0.15 gram 
samples are taken from the combs for the Arealometer determinations and 
0.02 gram samples for the Stelometer. These very small sub samples 
still contain many hundreds of cotton fibers. The problem next is to 
determine how many replications are needed in the measurements. Ex- 
perience has shown that two replications suffice for the length and 
fineness measurements and four are needed for the strength and elonga- 
tion. Referring back to Table 1, it is seen that this need follows 
from the higher coefficients cof variation shown by the strength and 
elongation measurements. The outstanding exception with an abnormally 
high CV, the immaturity value D, is a calculated value based on the 
difference of two large numbers which are A values at different 
pressures, and is accepted for its worth without going to considerable 


trouble to bring it into line. 


With the fiber analyses completed, the data is either referred 
back to cotton breeders or carried on into yarn processing, depending 
on the job at hand. 


Spinning Process 


The fiber analyses are used to set the various operations of the 
spinning process to optimum efficiency. For example UHM length, and 
to an extent fineness, governs the speed and settings of the card. 
Combing is dependent on UHM and M lengths and their ratio, and to an 
extent on fineness. Drawing, roving and spinning are all critical of 
UHM length. Roving and spinning are also dependent on fineness. For 
correlation work, fineness and immaturity are related to neppiness and 
fiber strength to yarn strength and to some features of processing. 
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In processing cotton fibers into yarns, it is necessary to keep 
the throughput or weight per unit length of material under process 
a constant in order to maintain control. Figure 2 illustrates a 
complete mill process of which the Spinning Laboratory Test is an 
abbreviated version indicated by the solid lines. Note that feedback 































































































oa os + 4 * * 
—>— FEED |-JoPEN L] LAP }ICARD LiCOMB 14 DRAW 
=" 
Les Le 
* ] 
r?P--* 
' ' 
' ! 
one ae ‘ peony ‘ree © saee. 
WEAVE Lt SIZE |--iWARP'--\WIND tLISPIN LLIROVE 
L-.--J L---_ bh ie Beye 


Y Y 
FIGURE 2 


is used in each step to keep the process under control. In nearly 
every case, the feedback consists of the elementary procedure of taking 
a sample length of material, weighing, and then adjusting the machine 
if necessary. The effectiveness of this simple control is due to 
several factors: careful design and stability of the machines, frequent 
doublings to eliminate random irregularities, and relatively large 
samples for each measurement. For example, in the spinning test, the 
samples at the successive stages represent the following percentages: 
Feed- 100%, card- 100%, draw frame- 11%, roving frame- 25%, spinning 
frame- 55%. Obviously, where the sample is 100% of the population, 

the process is under better control than in the cases where the sample 
is as low as 11%. An improvement can be made if the sampling is done 
continuously, on all the material, and in a manner which will not dis- 
turb the process. Such a sampling can tell much about the short term 
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variation in the material under process, a factor which is lost in large 
group samples. Where the operation has been starred in Figure 2, auto- 
matic feedback control is already being used in the mills or is being 
studied. 


To make continuous type analyses and study the processing varia- 
bility, uniformity analyzers have been developed for recording the cross 
sectional weight of yarns, rovings and slivers. Two approaches to the 
analysis have been developed. The first type records the weight per 
unit length of a fixed length of material and gives the mean and 
variance values for the length. The second type gives a record of 
weight per unit length and the mean and variance values on a continual 
basis. It should be noted that the first type will give the same mean 
and variance values regardless of which direction the material is run, 
whereas in the second type, the values are affected by previous read- 
ings in a continually diminishing degree, depending on the time constant 
in use. However, it would appear that either method gives about the 
same amount of information in that the fixed length analysis is just as 
likely to give different readings for different lengths as is the con- 
tinual analysis to be affected by past history. A point in favor of 
the continual analysis is that it may be applied to a process for feed- 
back control. 


An analyzer of the second tyne, using beta radiation which permits 
100% sampling on a continuous basis without disturbing the process has 
been developed at the University of Tennessee. (l) The statistical 
analysis is attained as shown in the block diagram of Figure 3 which, 
it will be recognized, contains an adaptation of the machine method for 
obtaining variance to the continual process, Analysis of this type is 
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complete as far as variability. is concerned if the irregularities are 
normally distributed. In this case, the mean and variance or lst and 
2nd moments completely describe the variability. However, processing 
might well introduce 3rd and th moment distributions through non- 
linearities or instability so that it may well be necessary to start 
investigating the automatic analysis of these moments. Periodic 
variability is already being automatically analyzed for fixed lengths 
of material by an instrument now on the market. 


Yarn Testing 


In conclusion, the final step of the Spinning Laboratory Test is 
the evaluation of the strength of the yarn produced. As might be 
suspected, the more yarn that is broken, the better is the average 
value obtained. This may be illustrated by studying the variability 
of skein and single strand yarn breaking methods used in the Spinning 
Laboratory. 


The standard method for determining yarn strength at break is to 
wind an 80 turn skein using 120 yards ver skein which, with 20 breaks 
per test, gives a total of 2400 yards used. This method has a CV of 
about 3%, In the Spinning Laboratory, a shorter length of about 22.2 
yards for a 40 turn skein with 30 breaks gives a total of 666 yards 
per test. The CV for this case is about 4.5%. The square root re- 
duction in yardage of yarn used is 2 to 1, but the CV has increased 
only 1 to 1.5, so it may be said that the smaller skeins are, statisti- 
cally speaking, more efficient. Further reduction of the skein length 
to 11.1 yards and doubling the turns to 80 at the same time reduced 
the CV to about 3.5%, still further improving the efficiency, but the 
practical problems of handling such skeins have obviated their use for 
the present. 


More information is obtained if the yarn is broken single strand, 
since a direct indication of yarn strength without the complications 
of frictional effects is attained and the determination of yarn 
elongation is likewise obtained. However, much less yarn is broken 
per machine operation. For example, if 100 breaks of 20 inch gage 
length yarn were made, only 55 yards have been tested, and the result- 
ing CV of 12%, as found by experiment, is about what might be expected, 
The above data was all taken with tests on 22's yarn. It might be 
expected that the statistical efficiency per machine operation could 
be improved by using heavier weight yarns, Tests have shown this to 
be true, and the work of the Laboratory is proceeding in that direction, 


In order to make single strand breaking feasible, it is necessary 
to make many breaks and consequently, to employ automatic instrumenta- 
tion. In the Spinning Laboratory, much research work is done on the 
well known Model IP-l single strand breaker. An automatic indicat on 
of the average of a numberof breaks was attained by rigging one 
counter to a gridwork and a break indicator to totalize the plane table 
travel and the other counter to totalize the number of breaks. For 
routine testing, an even more elegant system is used in a commercial 
instrument which gives the average of breaks and elongation and also 
frequency distribution of the breaks, 


Overall control of the Laboratory is attained by frequert com- 
parison of the yarn properties with the fiber properties of the cottons 
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under test and by occasionally running a standard cotton through the 
entire process. 


Conclusion 
In a spinning Laboratory, which is a miniature version of a 
staple yarn processing plant complete with a fiber analysis set up, 


it is observed that the science of statistics, coupled with an adequate 
instrumentation, can result in accurate control of yarn manufacture. 
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LIGHTING FOR INSPECTION 


Willard Allphin 
Sylvania Electric Products Inc. 


Note - This paper was prepared primarily for oral 
presentation, using slides and shadow-box demon- 
strations. The slides are reproduced here as 
photographic illustrations, but the demonstrations 
can only be described briefly in parenthetical 
remarks. 


A blind man could work as an inspector if his job were to try 
pieces in a go-no-go gauge. However, most inspection tasks involve 
seeing, and are therefore wholly dependent upon illumination. 


I shall not attempt to give complete prescriptions for lighting 
all inspection tasks, but shall attempt to remind you of the prin- 
ciples involved and to give some examples of their application. 


Lighting engineers like to divide illumination into quantity 
and quality considerations, and this will be useful also in discuss- 
ing inspection work. We can further divide quality into two categories. 
Objective quality, including the direction of light, its color and the 
amount of diffusion has an important bearing on the work. So does 
subjective quality, involving such reactions as "good or bad," 
"pleasant or unpleasant," "cheerful or depressing." All can affect 
the speed and accuracy with which inspection tasks are performed. 


Let us first consider quantity of illumination, or more properly, 
the brightness of the task. The illumination in footcandles is only 
a means to an end. The important thing is the brightness of the task, 
since we see things only by the light they emit or reflect. 
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Fige 1 Acuity increases with the brightness of the 
task as long as its surround is bright. 
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Figure 1, reproduced from the Dlluminating Engineering Society 
Handbook, is typical of how a visual skill can be affected by the 
brightness of the task. This is for visual acuity, the ability to 
see small detail, but other visual skills, such as speed of vision, 
depth perception and contrast sensitivity behave in the same way. 


It is true that the dotted curves turn downward after particular 
brightness levels are reached, but these curves represent cases where 
the surrounding area was considerably darker than the task. Thus, as 
the brightness of the task increased, a point was reached where the 
task itself became glaring enough to interfere with vision. However, 
glare is a relative thing; and when the surround was bright and in- 
creased in brightness with the task, the solid curve shows the result. 
Acuity continues to increase with brightness even up to sunlight levels 
of illumination. 


Here is a practical demonstration of what I have been saying. 
In order to reach brightness which will be effective at a considerable 
distance, I have lighted the task by transmitted light instead of re- 
flected light, but the results could be shown with reflected light if 
you were close to the task, as you would be in most applications. 


At this brightness level I am sure there is a limit to how much 
detail you can see in the target. (Shadow box with small lighted 
circular target area in dark surround.) When I increase its bright- 
ness you will be able to see more detail and see it more easily. 
(Target brighter. Surround still dark.) Now if I increase the bright- 
ness to this level, it is so high as to be glaring; and you can see 
less. (Target very bright. Surround dark.) 


However, here is the important effect of relative brightness. 
When I increase the brightness of the surround, you can see better 
than ever, yet I have not changed the brightness of the target. 
(Target as before. Surround bright.) 


Applying this principle to industrial tasks, we should furnish 
enough light to make the task easily discernible, and make the sur- 
round bright enough so that it will not contrast greatly with the 
task. 


Beyond the immediate surround of the task, quality considerations 
apply both objectively and subjectively. Fixtures should not produce 
glare which will make it difficult to see. Ceilings, floors and other 
significant areas should not be so dark in tone as to cause the eyes 
to re-adapt every time they look away from the work. 


Not precisely measurable, but none the less important, is the 
"pleasantness" of the environment as affected by the lighting. Color 
here plays an important part and is, in turn, affected by the lighting. 


Direction of light is another important factor in revealing de- 
tail, particularly in inspection work. Suppose the inspector is look- 
ing for scratches on a surface. With diffused illumination, the 
surface may look like Figure 2. 
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Fige 2 Scratched surface illuminated with diffused 
light. 


But when we throw the light in a particular direction, from a glanc- 
ing angle, the surface may look like Figure 3. 











Fige 3 Same surface as in Fig. 2, illuminated with 
"sharp" light from glancing angle. 
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An extreme case of changing the direction of light occurs when we can 
see a particular task better in silhouette. This may be done by plac- 
ing the object in front of a lighted area or, in the case of small 
parts, an optical comparator may be used. 


Figure ha shows the mount structure of a miniature electronic 
tube, and Figure kb shows the central portion turned 90° and en- 
larged further, The wire is considerably smaller in diameter than a 
human hair, 



























































Fig. ha Fig. kb 


Mount structure and grid portion from miniature 
electronic tube. 



















Another type of backiighting is used when cloth is moved across a 
"perch,* as in Figure 5, where the light passes through the product, 
revealing faults in weaving. 





Fig. 5 Inspecting cloth on "perches" by tranamitted 
light. 
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Edge lighting will often show flaws in glassware, as in Figure 6. 


Fig. 6 Inspecting glassware by edge lighting, 
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A special case is where the material being inspected has a surface 
which acts like a mirror, If the problem is to see waviness of the 

surface, this can be shown by reflecting in the surface an image of an 
illuminated grid, as in Figure 7. The method can be used for inspect- 
ing such things as sheet mica, as well as the more obvious shiny sheet 
metal. 








Fige 7 Inspecting a shiny metal surface by viewing 
a reflected pattern, 
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Sometimes reflections must be reduced in order to see details 
under the gloss of a surface. In such cases polarized light may be 
the answer. 


Some substances have a natural tendency to fluoresce under black- 
light, that is, ultraviolet radiation just beyond the visible range. 
If surface flaws cannot be seen properly under visible light, they may 
reveal themselves under blacklight. A patented variation of this is 
where a casting is exposed to blacklight after having been dipped in 
a fluorescent penetrant. Cracks which would otherwise be invisible 
will show up under these circumstances. Figure 8 shows a flawed cast- 
ing under normal light and under blacklight, after having been in- 
mersed in the penetrant. 


Fig. 8 A flawed casting under normal lighting and 
under blacklight after having been treated 
with a fluorescent penetrant. 

(Photos courtesy Magnaflux Corp.) 


270 








Except for blacklight, we have as yet said nothing about the 
color of the light source. In many cases this is unimportant as long 
as the color is reasonably "natural" in appearance, However, there 
are times when color is very important. For example, these two cloth 
samples look much alike, (cloth samples in shadow box) but when I 
change the color of illumination they appear to be quite different. 
When two samples behave in this manner they are called a metameric 
pair. 


Sometimes when a complete match is needed between two samples it 
is necessary to view them under different illuminants in order to make 
sure that they are not metamers. (Samples which look alike under both 
colors of illumination.) 


Sometimes specifications call for samples to be examined under 
“daylight,” but even where further specified as "north sky daylight" 
the color will change with time of day and weather conditions, Much 
greater uniformity is obtained fram combinations of fluorescent and 
incandescent in special "daylight" fixtures. Many times the addi- 
tional red wavelengths from the incandescent lamps are not really 
needed. It is more important to have a standardized color of illumi- 
nation than it is to have a color which exactly matches some particular 
lighting condition in nature. The human eye is a very adaptable in- 
strument and can readily become used to working under any one of 
several “white" light sources as long as all of the visible wavelengths 
are present and the unbalance is not extreme. 


Sometimes the balance is purposely upset. For example, a photo- 
engraver finds it difficult to inspect the yellow proof fram a color 
printing plate, because of the paleness of the impression. (Yellow 
proof in shadow box under white light.) 


Illuminated by blue light, the complementary color to yellow, the 
ink shows up as dark against the white background of the paper, and 
imperfections in the plate are more readily seen, (Proof illuminated 
by blue light.) 


Inspection lighting should provide more than the bare opportunity 
to see the required detail; it should enable the inspector to see as 
easily and comfortably as possible. It is not possible as yet to 
scientifically measure fatigue, but the common sense viewpoint would 
certainly indicate that fatigue should be less at the end of the day 
when the lighting has been as revealing and as pleasant as possible, 


There are many types of inspection problems which have not been 
mentioned here, but each should be met by a joint effort between the 
quality control engineer and the lighting engineer, the quality man 
to define what must be seen and the lighting man to recanmend the means, 


271 





272 











THE USE OF KEY SORT CARDS FOR COLLECTING AND 
ANALYZING DATA IN THE MANUFACTURE OF LARGE WIRED EQUIPMENT 


George P. Lewett 
Western Electric Co., Inc. 


Introduction 


This article reports on the use of key sort cards for the collec- 
tion and evaluation of quality data at the Western Electric Company's 
Kearny, New Jersey Works; the report is supplemented by an example of a 
control application and a section dealing with the statistical analysis 
involved, While the Western Electric Company has, for many years, pio- 
neered the development and application of quality control techniques, 
further work is being carried out continually. The particular illus- 
tration used deals with a type of telephone equipment designated as 
wired equipment, in which major operations are the assembly of com- 
ponents and their interconnection by the use of a large number of wires, 
The control problem centers about the quality of the soldering performed. 


Effective control of product quality depends to a great extent on 
proper analysis of data. In general, the nature of large wired equip- 
ment and the manufacturing processes peculiar to it make analysis par- 
ticularly difficult. Information on quality obtained through inspection 
and test is difficult to handle and evaluate due to the variety in types 
of defects and the many stages involved in manufacture. As a result 
therefore, if proper facilities are not provided, the information will 
be transformed into an overall measure of shop performance; little use 
will be made of dev ils for the specific purpose of isolating and cor- 
recting sources of vrouble. The Key Sort Card System described in this 
article illustrates the type of facility required to permit better use 
of the details. 


The Manufacturing Process 


The process selected for initial application of the Key Sort Card 
System is concerned with the assembly and wiring of a telephone switch- 
board. The switchboard is one in fairly common use, relatively small in 
size (for this class of equipment) and of rather simple circuitry. How- 
ever its physical characteristics raise certain manufacturing and quality 
problems which are identical in nature to larger wired equipment. It 
contains approximately 1000 soldered connections which are made in two 
separate wiring stages due to the compact nature of the wiring. Wiring 
cycles in the two stages are fairly long so that the production from any 
particular operator is numerically small in terms of units per day or per 
week, Since it is required to hold insecure connections to a level of 1 
in 10,000, a 100% visual inspection of soldering and wiring is made after 
each wiring stage followed by operational test and a final inspection of 
apparatus mounting, assembly and stamping. For the most part, soldering 
defects result from operator error, although defective solder and de- 
fective soldering irons are contributing factors. Inspectors perform a 
complete inspection of each connection on the unit and are instructed to 
report to the shop for repair such defects as excess or insufficient 
solder, insecure connections, broken wires, damaged insulation, etc. It 
is recognized that differences are to be expected between inspectors in 
their judgment as to what constitutes a defect and their effectiveness 
in detecting defects. Normal methods of controlling these variables 
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Figure 1. Key Sort card used for collecting and 
analyzing data on quality (hypothetical 


data). 














are in use but judicious use of the card system can be expected to pro- 
vide additional means of control. 


The Key Sort Card 


The card illustrated in figure 1 was designed to tie in directly to 
the process. The numbered perforations for the most part refer to oper- 
ators at each of the several stages in the process. The card is attached 
to the switchboard in the assembly stage and the perforations are marked 
with a pencil by the various operators, inspectors and testers as the 
board moves through the manufacturing process. After final inspection 
the card is removed and punched; the punched perforations then will indi- 
cate the route which the particular switchboard followed -- for example: 
a 507B switchboard, assembled in the first three stages by operators #1, 
3 and 7 respectively, stranped by operator #16, inspected by inspector 
A, etc. The inspectors and testers record their results directly om the 
face and on the reverse side of the card. The sorting feature is ap- 
plied by stacking cards from completed switchboards, passing a sorting 
needle through a particular perforation and lifting the stack; cards 
punched in that particular perforation will then separate from the main 
deck. The holes at the lower right, marked "Defects", are to be used on 
a "special study" basis to give information on particularly troublesome 
items; those at the top left are used for general control purposes. 


and the Need f ta e 


It is an accepted precept of quality control that quality should be 
built into a product rather than achieved by screening and reworking. 
Inspection operates according to this principle in two ways: (1) by 
rejecting poor product for reworking at shop expense, it provides in- 
centive to do the job right the first time and (2) by feeding information 
back to the making shop it enables the shop to eliminate or control 
sources of error. In the manufacture of the equipment under study 
requirements for 100% inspection and the difficulty of processing defect 
data have combined to hinder the achievement of these two objectives. 
For 100% inspection to operate effectively in accepting product, two sets 
of criteria are required: (1) criteria for either accepting the indi- 
vidual product or returning it as a reject to the shop for repair and 
(2) criteria for accepting or rejecting (halting) the flow of product 
for sorting and verification. Where wired equipment is concerned however 
due to the wide variety in types of defects, variations between in- 
spectors, low output of completed units operator (due to the large 
number of connections required per unit), etc., it becomes extremely 
difficult first of all to establish criteria of the second type and 
secondly to set up means for rapidly comparing operating quality against 
these criteria. The second purpose of Inspection -- to feed back infor- 
mation -- is also frustrated by the difficulty of processing data so as 
to reveal sources of defects. The end result is frequently degeneration 
of inspection to a sorting operation with Inspection becoming responsible 


The Key Sort card is a facility designed primarily to remedy to some 
degree the difficulties involves in processing data and was stimlated 
partly by the thoughts expressed in the preceding paragraph. Aside from 
the sorting feature, the durable construction of the card and the fact 
that each card gives a complete quality and production history of a spe- 
cific switchboard are decided advantages over conventional methods. As 
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a trouble-shcoting facility however the card evokes the most interest. 
Trouble-shooting in processes of the type under discussion is basically 
a matter of comparing two or more equivalent defect sources and de- 
termining whether a particular source is significantly different in the 
frequency with which it emits or causes defects. In order to make valid 
comparisons however particular consideration must be given to the bases 
for comparison. In equipment precesses this will entail taking into 
consideration, when comparing for instance the performance of two oper- 
ators at a particular stage, such factors as type of defect, period of 
time, tester or inspector identity and operators in preceding or subse- 
quent stages. Due to the numerically low output per operator and the 
number of levels so to speak of each of the factors listed, this would 
normally involve handling large quantities of data. The Key Sort card 
is particularly suited for culling out that data which is pertinent to 
the problem for on-the-spot diagnosis or for the establishment of pre- 
liminary hypotheses. An example of this function is given in a subse- 
quent section. 


As a matter of practical application, the completed cards are 
received initially by the shop supervisor. He is therefore provided 
with a tool, not heretofore available, for more effectively utilising 
the information provided by inspectors and testers. Application of the 
sorting feature enables the supervisor to maintain a continual check on 
individual operator performance, identify and rapidly run down reasons 
for major deviations from standard, verify the accuracy and consistency 
of inspectors, etc. 


An_ Example 


Soldering and wiring quality, as measured by the visual inspectors, 
had been charted over a period of two months in terms of defects per five 
switchboards (figure 2). The pattern shown however was not completely 
satisfactory and it was decided to study the results in greater detail. 
The cards containing the charted results were sorted into all possible 
combinations of operators and inspectors and each combination was 
arranged in order of wiring date. Particular combinations were selected 





Figure 2. Overall soldering and wiring quality 
from visual inspection results. 
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for analysis due to the frequency with which they occurred and their 
spacing in the time period considered. For each combination selected 
five successive switchboards were taken starting with February 6 (Phase 
I) and the results recorded as figure 3A; results on the last five 
switchboards (Phase II) in each combination were recorded as figure 3B. 
The data was then analyzed for the purpose of establishing preliminary 
hypotheses as to probable causes of variation; it was also of interest 
to determine whether a shift in product quality has occurred and how 
operator and inspector performance were affecting presentation of group 
results, 


In this example, since analysis was primarily of an exploratory 
nature, the control chart approach was felt to be particularly suitable. 
As illustrated in figure 4, the two levels for each factor were first 
compared in each of the two phases, In these comparisons use was made 
of the recent report* that the difference between two Poisson variables 
is distributed with mean (m) - m2) and variance (m) + m5). Hypothe- 
sizing no difference between levels, m = m, = mo, the expected differ- 
ence is zero and the standard deviation of the difference is = Sq. 
For purpose of this analysis 3 s, limits were considered significant 
although probability levels derived from the exact distribution function 
were not derived. It can be stated however from consideration of the 
a inequality that the significance level is at least better 
than 11%. 


For Phase I therefore a center line was drawn at the average for 
samples of 20 (Cap = 38 = 19) and limits draw at 


2 


Defects found per 20 switchboards were then plotted for the two 
levels of each factor -- two points equidistant from the center line 
C20. A significant difference is indicated if both points are outside 
of limits (a difference of 3 83). The effect of time was then studied 
in a similar manner, visually comparing the difference from Phase I to 
Phase II for each level of the three factors. 


The results of the study indicate in brief that there is a strong 
likelihood of difference between the two inspectors and between the two 
operators at the Strap Wiring Stage. The general improvement in quality 
indicated however would seem to be primarily a result of the strong 
improvement in the performance of operator #4 at the Surface Wiring 
Stage. 


* See reference 1; the frequency function for the difference 
between two Poisson variables is given as 


-(m, + n>) x/2 
P (x) =e - (m)/n2) ole (2 1] n>) 


where Ixy (t) is the modified Bessel function of order x and 
argument t. 


277 














SURFACE WIRING OPERATORS 















































2 wh 
INSPECTORS INSPECTORS 
A B A B 
1 1 n 4 13 
STRAP _ 
WIRING 
OPERATORS 
2 1 2 0 6 
-15- =23- 
C20 
A-63 B= 32 
3B. 
PHASE II 
3 4 
A B A B 
1 3 9 0 3 
n= 
8 
2 0 0 0 4 
> = 
C25 


Figure 3. 








A-3;B-16 


Data indfects per five switchboards 


20 


20 


15 


19=C 











Figure 4. 





alysis of data 


279 








Concluding Remarks 


Statistical Quality Control is frequently thought of as a collec- 
tion of specific techniques which have proved to be particularly 
effective in maintaining product quality at an acceptable level. This 
concept, emphasizing techniques rather than principles, can be parti- 
cularly damaging to the cause of Quality Control in some areas of manu- 
facturing. It should be recognized that a particular technique -- such 
as continuous sampling or use of a standard X, R chart -- is only appli- 
cable under certain specific conditions and in general, it is con- 
ceivable that control may be achieved solely through adherence to basic 
principles. Of these principles, in this sense the most important is 
the rigorous use of information to control quality at the source. In 
manufacturing processes, such as that under discussion in this article, 
adherence to this principle requires development of better methods of 
collecting and handling information for rapid analysis and transmission 
to the making shop. The Key Sort card is one such method, of limited 
application of course, but indicative of the approach needed to 
effectively install a system for controlling quality in processes of 
this type. A particular advantage of the Key Sort card is that its 
durability permits it to travel with the equipment in process and its 
simplicity engenders rapid evaluation of results for shop control pur- 
poses. In addition, it is a permanent record which lends itself to more 
leisurely statistical evaluation without the need for transposing the 
data. My remarks have, I hope, illustrated both of these applications. 
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THE PROBLEM OF CERTIFICATION THAT MATERIALS 
MEET STATISTICAL SPECIFICATIONS 


R- D. Smith 
Union Carbide Nuclear Company 


For a number of years the Y-12 Plant of Union Carbide Nuclear 
Company in Oak Ridge, Tennessee has been confronted with a materials 
quality assurance and certification problem which is, if not unique, 
at least unusual. Our customers have presented us with this problem 
in the form of specifications for the chemical constituencies of our 
products. These specifications are written on a statistical basis. 
In the absence of specific references to similar problems in the 
quality control literature we have devised several methods for cer- 
tifying that our products meet these specifications. Since we feel 
that specifications of this type will probably become more common in 
the future, we trust that the following brief description of our methods 
will have some value for others. 


To exemplify the type of specifications to which I refer let me 
invent a chemical compound, say "unium carbide", which, we will say, 
my plant is committed to produce in considerable quantity over an 
extended period. For such a compound we might be provided the fol- 
lowing specifications: 


Chemical Specifications 





1. This material shall be unium carbide. 


2. Chemical analyses for unium, carbon and impurities listed 
below shall be performed on as many samples as necessary 
to fulfill the requirements of the following paragraphs. 


3. Using standard statistical techniques, the manufacturer 
shall be able to demonstrate: 


a. With 95.0% confidence that 97.5% of the material 
delivered contains at least 0.780 grams of unium 
per gram of material. 


b. With 95.0% confidence that 97.5% of the material 
delivered cortains at least 0.190 grams of carbon 
per gram of material. 


c. With 95.0% confidence that 97.5% of the material 
delivered contains no more than 500 parts per mil- 
lion by weight of iron or sulfur. 


4, The manufacturer shall submit querterly reports giving the 
results of the chemical measurements and the statistical 
calculations which demonstrate that the conditions of para- 
graph 3 have been satisfied. 





Aside from a certain ambiguity in the phrase "standard statistical 
techniques" the specification at first glance seems to be easily follow- 
ed assuming, of course, that the product is truly acceptable. Serious 
difficulties can arise, however, in demonstrating statistically that 
these specifications have been met even though the product is good. 


Basically, we have used four distinct statistical techniques for 
this purpose. The first and simplest of these we call the total vari- 
ability method. Suppose we consider the impurity sulfur for which the 
specification set an upper limit of 500 parts per million in our imag- 
inary compound. Let us say that our analyses for sulfur average 50 
parts per million and that these analyses have a variance of 2500 parts 
per million squared. Then “eS ) of the standard deviation in conjunction 
with the tolerance factor, K 1) » will provide a certification statement 
which is satisfactory. It should be noted that the variance of the 
analyses was used directly without regard to the sources of variance. 
The following table summarizes the above calculations and gives the 
certification statement: 


Number of Analyses N 20 

Mean of Analyses Xx 50 

Variance of Analyses S¢ 2500 

Tolerance Factor K 2.752 
(N, .95, .95) 

Computation X+K (N, .95, .95) St 188 


Certification: We are 95.0% confident that 97.5% of the 
material delivered contains no more than 
188 parts per million by weight of sulfur. 


We have met the specification in all respects. We must recognize, 
however, that the certification statement paints a conservative, rather 
than an accurate, picture of our out-going product's quality. This is 
because the variance used includes a component from measurement vari- 
ability. It has been our experience that chemical analytical measure- 
ment methods frequently give rise to variabilities as large as, or 
larger than, the inherent variability of the characteristic measured. 
Nevertheless, this total variability method is used whenever possible 
because of its simplicity. Unfortunately, it is'seldom that we are 
able to operate so far within specifications as to permit use of this 
method. 


The variance of analytical resuits obtained on a product will in- 
variably be composed of two components. These are (1) the inherent 
variance of the characteristic of the product to be measured, that is, 
the process variance, and (2) the variance of the measurement method 
itself. We term these variance oguponants rocess variance, Sf, and 
measurement variance, Sf- Then SS + Sé = sp. I have already mentioned 
that, more frequently than not, it is impossible to meet our specifi- 
cations using the total variance, S¢. 
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The other three certification methods to be described are therefore 
based on the separation of these variance components. That is, they use 
only the process variance which, after all, is really the only portion 
of the variance with which the customer is properly concerned. The certi- 
fication method which has been used most frequently is the variance sub- 
traction method. 


Briefly, the procedure followed is this. The total variance of a 
set of analytical results is easily obtained. The measurement variance, 
obtained from a laboratory quality control program, is subtracted from 
this total variance to obtain an estimate of process variance. The de- 
grees of freedom of this process variance is obtained by Satterthwaite's 
approximation method. 2) The process standard deviation, together with 
the tolerance factor, then provides us with a certification limit. Let 
us take an example using our fictitious element unium: 


Number of Analyses Ny kl 

Mean of Analyses X 0.7850 
Total Variance Sf - 000004 
Degrees of Freedom of No-1l 20 
Measurement Variance 

Measurement Variance Sh -000001 
Process Variance s¢ - Sh = S§ 2000005 
Degrees of Freedom of (36)° = N3-1 20 


Process Variance (s¥)* + (Sm) 


Ny -1 No-l 


Tolerance Factor 


K (N35, -95, -95) — 
Certification Computation &-K (yz, .95, .95) Sp 0-7802 


Certification: We are 95.0% confident that 97.5% of the material 
delivered contains at least 0.7802 grams of unium 
per gram of material. 


Thus we have s satisfactory certification of the product as accept- 
able using the variance subtraction method. It will be noted in the 
example that Sg is four times Sg. Suppose the measurement variance 
component is relatively larger in the example. We then might have: 











Number of Analyses Ny 41 





Mean of Analyses Xx 0.7850 
Total Variance sé - 000004 
Degrees of Freedom of No-1 20 
Measurement Variance 
Measurement Variance 3g «000003 
— —_— 
Process Variance sé Sm = Sp -000001 
Degrees of Freedom of (s§)@ <i 
Process Variance 5 >> 3 1 
(s€)° + (s&) 
Nyi-1 No-1 
Tolerance Factor Approaches infinity. 


Therefore in this variance situation no certification is possible 
using this method. With a little thought this result is understandable. 
The smaller the process variance to be estimated, the larger is our 
relative uncertainty of the estimate. In the mathematics this is indi- 
cated by the diminishing degrees of freedom. 


We have now had a look at the second certification procedure used 
by Y-12, the variance subtraction method. This is a useful tool in many 
applications but it fails, even on good product, when the measurement 
variance is large relative to the process variance. 


Our third method of certification we call the Chi-square method. 
This method has at times provided satisfactory certification statements 
for product which could not be certified by the variance subtraction 
method. The Chi-square method consists of placing an upper 95 percent 
confidence limit on the total variance and a lower 95 percent confidence 
limit on the measurement variance and subtracting these limits. The 
resulting difference is virtually an upper 95 percent confidence limit 
on the process variance. The standard deviation obtained from this 
process variance limit can be utilized together with the normal distri- 
bution statistic, 1.96, to provide a certification value that meets the 
requirements of the specification. Consider, as an example, the situ- 
ation last described in which the measurement variance was three-quarters 
of the total variance. 








Number of Analyses Ny 41 


Mean of Analyses x 0.7850 

Total Variance s¢ -000004 

Degrees of Freedom of No-l 20 
feasurement Variance 

Measurement Variance sé 2000003 

Upper Limit on Total Variance UL S¢ .00000593 

Lower Limit on Measurement LL sé 200000144 
Variance 

Upper Limit on Process Variance UL S§ 200000449 

Certification Computation X - 1.96 UL Sy 0.7809 


Certification: We are 95.0% confident that 97.5% of the material 
delivered contains at least 0.7809 grams of unium 
per gram of material. 


As you see the Chi-square method results in a legitimate certifi- 
cation statement using these data where the variance subtraction method 
failed. For this reason we have often used the Chi-square method. 
Nevertheless, this method, too, has built-in draw-backs. The principal 
difficulty is that in spite of increasing degrees of freedom the reli- 
ability of an estimate of variance does not increase very fast. Thus 
we have often found that the limits on our variances were too wide result- 
ing in an upper limit on process variance which was too large to permit 
certification of a product. You will note in the above example that the 
upper limit on the process variance is larger than the total variance 
with which we started. A few less measurements and certification would 
have failed. 


The methods I have described thus far are those that occurred to us 
immediately for meeting the specifications. I have mentioned the diffi- 
culties encountered in using each of these methods. Fortunately, we 
never failed to be able te certify a product we believed to be actually 
acceptable using these methods. But it was recognized that such a fail- 
ure was possible. 


Another consideration bears mentioning in connection with these 
methods. Although a mechanic were to select the wrong socket wrenches 
several times before finding the right one to turn a certain nut he 
would certainly not be accused of a lack of ethics. But, as you may 
have noticed, laymen tend to distrust statisticians. These certifica- 
tion methods have the peculiarity that if one procedure certifies the 
product, the product is good even though the other procedures failed. 
As in the case of the wrong socket wrenches the difficulty is in the 
tool used. Still, the statistician is open to the criticism that in 
order to certify sub-standard product he juggles certification methods 














until he finds one that will do the job. Obviously it would be desir- 
able to have a universally applicable certification method which, like 
a monkey-wrench, would fit any problem presented. 

We are indebted to Mr. A. de la Garza +) of our company for devel- 
oping what we believe to be such a method. The derivation of this method 
is as follows: 

Population variances exist such that 

(1) of = cf- o &, 

ag is the total variance with degrees of freedom, N+, 
os is the measurement variance with degrees of freedom, Ny, and 
as is the process variance. 

Select a function ¢ such that 

(2) Prob. (OF S$ SE -52) -1-c 

in which is small. 
Then 


(3) sa+O06 <4 


Se 
(4) os a) <¢ 





Observe that 


(5) sé is distributed as 72 
Th Nn 

(6) S€ is distributed as Ag 
N 

of t 








Substituting in (4) above, we have 


(9)} 2 + (« - 2) 
Xn hi <# 
——~— | /s— 


It will be seem that the expected value of the numerator of this 
expression is as follows: 


(10) [LS + (x - 1) 
EIN =1+(kK-1)=1 
K K 





and the variance of the numerator is 


(ar) [7g + (Ke - 2) vffa) 
Vv Tn oa m/ = 1 


K Ke Ke MN, 


Thus the numerator of the experssion is approximated by the distri- 
bution YW / Nm with degrees of freedom K° Mn. 


From (9) we now have 


(12) Ye/k? ny 


Y €/Ne 


This we recognize as Fisher's F so $ = F (K°Nm; Ne;ac ) and from (2) 


g 


» 
(13) Prob. Kn, 


E/N 
(14) Prob. a8 < F (KM; Nys;ee) SE - 5g | 1 -« 
L 


< F (K°Nm; Nt; o€ )]= 1 - ce 








(15) Prob. |) < VF (K2N,; Nysec)] sg - s2 |- l-« 


In this expression oy represents an upper 95 percent confidence 
limit on the process standard deviation which can be used with the 
normal distribution statistic, 1.96, to provide a certification. 
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Let us now use this method with the last set of data: 


Mean of Analyses x 0.7850 
Degrees of Freedom of Analyses Ny 40 

Total Variance Ss .000004 
Degrees of Freedom of Nm 20 
Measurement Variance 

Measurement Variance S$ «000003 
F (K°Nmq; Ne; -05) - 1.72 
Process Variance Limit os 00000388 
Certification Computation X-1.9% F) 0.7811 


Certification: We are 95.0% confident that 97.5% of the material 
delivered contains at least 0.7811 grams of unium 
per gram of material. 


It will be noted that this certification is in good agreement with 
that obtained using the Chi-square method which resulted in a certifi- 
cation level of 0.7809. An interesting variation in this method is 
possible if the ratio of total variance to measurement veriange is close 
to one. This ratio, if you remember, is K which appears as K“Ny, one of 
the degrees of freedom used in looking up the F value. If K is nearly 
one, the K® is also nearly one, and may be neglected which simplifies 
the calculations. In the case above if K° is neglected the certification 
level becomes 0.7809. This demonstrates that the K ratio does not have 
to be very close to one to make neglecting it possible. When K is neg- 
lected our certification statement can read "We are at least 95.0 per- 
cent confident, etc". 





Insofar as we have been able to ascertain this procedure using 
Fisher's F will certify as acceptable any truly acceptable product if 
a reasonable amount of data has been collected for the variance cal- 
culations. The broad applicability of this method will eliminate any 
suspicion that the statistician adapts his certification methods to the 
product. 
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SQC_CAN BE MADE TO WORK 





Thomas A, Budne 
Rath & Strong, Incorporated 


Quality requirements on manufactured goods in almost all indus- 
tries have steadily increased over the past years, As the supply 
surpasses the demand for products in some industries, competition 
creates strong pressures for better quality and for lower manufactur- 
ing costs, The current pressures for more reliable military products 
are well-known to all of us, A science of Quality Control has evolved 
and grown at a speed, at least sufficient, to meet all these challenges 
of higher quality standards, lower production costs, and more reliable 
product, Stimulated by the thinking of Walter Shewhart in his memora- 
ble book, "Economic Control of Quality of Manufactured Product" and by 
the new sampling concepts of Dodge and Romig, the control chart and the 
acceptance sampling plans became the nucleus for the new science, 
Tremendous technological progress in measurement equipment, *he estab- 
lishment of basic principles with regard to the quality control function 
and its place in industrial organization, and a gradual absorption of 
more and more of the existing and growing body of modern statistical 
principles and methodology have brought the science of Quality Control 
to its present stage of evolution, 


The tremendous contribution of the Quality Control science to 
modern industry is quite generally acknowledged and accepted, Most of 
us who have entered the Quality Control engineering profession can 
point with pride to one or more particular applications of SQC princi- 
ples and techniques which have resulted in many tangible benefits to 
our companies, We look with admiration at a few companies which have 
been using this new science to its full potential and have been getting 
exceedingly profitable results, 


It appears strange, then, that when asked to recommend a number of 
companies which might serve as examples of Quality Control progress, we 
find it difficult to add many names to a relatively short basic list, 

It would be interesting to note how many of us would add the names of 

the companies in which we have Quality Control responsibilities, The 
explanation must lie in the fact that while there has been some QC prog- 
ress in most companies, it is quite small compared to our estimates of 
what it could or should be, The obvious question is, "WHY?", If companies 
can make a better quality product and reduce their quality losses by set- 
ting up a good quality control program, why haven't most companies gone 
all out? 
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A long list of reasons can be offered, but standing out sharply at 
the top is: 


THE FAILURE TO ACQUIRE TOP MANAGEMENT INTEREST, UNDERSTANDING, 
APPRECIATION AND SUPPORT and, or because of, THE FAILURE TO PLAN A 
REALLY PRACTICAL QUALITY CONTROL PROGRAM WHICH CONVINCINGLY ESTABLISHED 
THE FACT THAT EVERY DOLLAR SPENT ON QUALITY CONTROL WOULD BRING ABOUT 
A SUBSTANTIAL RETURN IN THE REDUCTION OF EXISTING QUALITY LOSSES, 





If we define QUALITY LOSSES as all the costs and losses which would 
disappear if all quality problems were eliminated, we necessarily in- 
clude Procurement, Design, Engineering, Manufacturing and all activities 
on through to sales and customer acceptance investigations, Better 
quality, then, is unavoidable if Quality Losses are reduced, We shall 
limit our discussion here to a program for the reduction of quality 
losses in manufacturing, 


Experience indicates time and again that while most measured effects 
are actually cumulative effects of many causes, a small minority con- 
tribute to a major share of the total effect whereas the large majority 
of causes are responsible for a very small share of the total effect, 


When all the losses due to sub-standard product in a manufacturing 
operation are broken down by products, we generally find that only a few 
out of many products are responsible for more than half the losses, 


When all the variation in a product can be apportioned to each of 
the sources of variation, we find that a very few of the sources con- 
tribute most of the variation, 


If all the scrap in a machine shop were broken down by machine, we 
would generally find that a few of the many machines or one type of 
machine out of many types are responsible for most of the scrap loss, 


This basic principle is essential in planning a QC program in which 
every dollar spent must account for several dollars of reduction in qual- 
ity losses, We shall refer to it as the principle of mal-distribution, 
It clearly indicates that profit-making SQC activity cannot be spread 
over a whole manufacturing operation like a coat of paint, since quality 
losses are certainly not spread out that way, This mal-distribution can 
at times be so severe that as few as 10% of all the categories of qual- 
ity losses may account for as mech as 75% of the total loss, In one 
particular plastics film extrusion operation, one category of loss ac- 
counted for more than 50% of all the losses, An investment of QC dollars 
could - and did — bring a tremendous return, 


If the mal-distribution principle seems at times to be out of opera- 
tion in getting a distribution of the contributions to a particular ef- 
fect, it may be that the best categories of breakdown have not been 
selected, For example, in an iron foundry, a breakdown of total scrap 
by types of defect indicated an almost equal contribution of scrap by 
each type of defect, 














However, this was a job-shop type of operation involving short runs on 
several hundred different patterns, with about 100 patterns in each 

- week, A breakdown by patterns showed that 15% of all patterns accounted 
fer more than 2/3 of all the scrap, A QC program which succeeded in 
eliminating the one most prevalent type of defect could mean a total re- 
duction’ in: scrap loss which was less than the cost of achieving and 
maintaining this reduction, However, a QC program which succeeded in 
reducing the scrap for the one largest scrap-producing pattern each week 
could bring a substantially greater return, As a matter of fact, QC 
program which placed its major emphasis on the 15 largest scrap-producing 
patterns each week and on keeping new patterns out of this category suc- 
ceeded in reducing an average monthly scrap loss of $20,000 to approx- 
imately $13,000 in 4 months and to less than $10,000 in 8 months, 


Charts previously kept on a large number of measurable operating 
conditions in sand and metal had been ineffective in accounting for scrap, 
Their cost was extremely difficult to justify, The QC program led to the 
fact that only a few of the many controlled operating variables were 
really important in affecting scrap, and tight control of these few could 
be maintained at considerably less cost than previously expended on all 
the operating variables, The mal-distribution principle was in operation, 
A substantial reduction in inspection hours and in finishing hours and 
the value of over-all improved quality were not included in the scrap 
loss figures, 


Having selected the areas for profitable investment of QC dollars, 
a systematic plan of action is necessary, Directly or indirectly, most 
quality losses are a result of variation in a particular quality char- 
acteristic of the product, This variation may manifest itself in an 
attribute "good or bad” property or in a measurable property which varies 
too mich to remain within specified limits for acceptable product, We 
shall assume that the importance and validity of the specification and 
the ability to adequately measure the product against this specification 
have been established as a very important but nevertheless routine func- 
tion of the QC system, 


Since we recognize variation as a cumulated effect of all non-constant 
conditions in the manufacturing operation, we again seek to isolate the 
few non-constant conditions which we. expect will account for most of the 
total variation, We are not particularly interested in noting whether 
these are assignable or unassignable causes of variation, We are in- 
terested only in identifying them as the major sources of variation, In 
other words if the process capability accounts for almost all the varia- 
tion, we shall seek those conditions which most affect the process 
capability, If the process capability is a small part of the total varia- 
tion, we shall look for factors which would normally be called assignable 
causes, 


If repeated tests on the same piece of electronic equipment account 
for a large share of the variation, then causes of instability in the test 
method, the test gear, the test conditions, or in the equipment itself 
must be found, 








In a processing operation, the minute-to-minute differences may be 
small but the hour-to-hour fluctuations may account for most of the 
product variation, Factors which can possibly change from hour-to-hour 
would be sought, 





It is possible for us to estimate the portion of the total variation 
which is attributable to changes occurring within a unit or from unit- 
to-unit, or lot-to-lot of raw material, machine-to-machine, hour-to-hour, 
operator-to-operator, shift-to-shift, day-to-day, or any other reasonable 
category. Each manufacturing operation has categories which may be char- 
acteristics of itself, A foundry operation would add: cavity-to-cavity 
differences, cope-to-drag differences, mold-to-mold differences, batch- 
to-batch differences in sand, heat-to-heat differences in metal, etc, 





Small samples, selected at the appropriate times and treated graph- 
ically will supply the information we want, A control chart procedure 
can be adapted for this purpose, but the mlti-vari charts (I) are even 
simpler and more appropriate for this application, 


MULTI-VARI ANALYSIS 
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In the above illustration in Figure I, 5 measurements on each of 5 
consecutive pieces at every two-hour interval indicate alternative con- 
ditions, 

















The first shows large within unit variation as the major component of 
variation, the second shows time-to-time variation as the major compo- 
nent of variation, and the third shows unit-to-unit variation as the 
major contributor, 


At this point, experience with the operation may be sufficient to 
identify the non-constant condition which explains the picture given by 
the multi-vari charts, A simple one-factor experiment with the suspect 
condition set at two or three levels will provide the objective evidence 
required, If two or three alternative conditions are strong suspects, 
their effects may simultaneously be evaluated by simple experimental 
designs, If the correct factor has been isolated, the experiment meas- 
ures its effect quantitatively, 


The control chart, the multi-vari chart, or some similar technique 
provide a systematic first step, Experience, horse sense, and simple 
designed experiments may sometimes suffice to finish the job, More often 
than not, however, the easily identifiable trouble sources have long since 
been isolated and kept in check, There may now be as many theories as 
there are variables; perhaps 15 to 20 of them including their interac- 
tions, which could explain the major category of variation identified in 
the multi-vari charts, The usual designed experiments fail us by becom- 
ing impracticably large and complex, Ten factors at 2 levels of each 
require 1024 test units, an impossible situation, 


The recent development of the Random Balance Experiment (2) by Frank 
Satterthwaite provides a major break-through by which the 20 factors may 
all be surveyed in as few as 30 test units, Underlying the successful 
application of Random Balance techniques, we again find under the mal- 
distribution principle, that only two, three, or four of the 20 factors 
under examination are responsible for a major share of the total vari- 
ation, 


In a chemical company, manufacturing a variety of plastic materials, 
the largest quality loss was attributable to one particular quality char- 
acteristic of one of the products, Multi-vari charts indicated that the 
greatest variation occurred from hour-to-hour, This could be explained 
by any of 12 operating parameters including several temperatures, percent- 
ages of ingredients, speeds, etc, Six of the 12 were used by the opera- 
tors to maintain control of the process, The recorded history of the 12 
variables measured indicated their normal range of operating conditions, 
Each of the 12 was programmed into a Random Balance Experiment of 30 test 
runs with the two levels of each set at the high and low values of their 
normal operating experience, The results showed that only two of the 
suspected six, along with one of the unsuspected 6, were critical in their 
effect on the product, A considerable portion of the variation which still 
remained was finally explained by instability in the gages in the process- 
ing equipment, By maintaining rigid specifications of operating conditions 
for these three variables, improving instrumentation, setting-up well- 
defined rules for regular calibration and the elimination of operator judg- 
nog making adjustments, 10% quality losses on the product were reduced 

° ° 





Briefly described, the Random Balance Experiment requires the 
randomization of an equal number of each of the levels of each of the 
factors among the 30 runs (or other specified number of runs), 


SPECIFICATION OF TEST CONDITIONS 
IN RANDOM BALANCE EXPERIMENT - 
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Figure II 


Variable I has 15 High levels and 15 Low levels in randomized order, 
Variable II has 10 High levels, 10 Medium levels and 10 Low levels in 
Random order, Variable XX has 10 each of different lots A, B and C of 
one of the raw materials in Random order, 


The results are quickly surveyed in a series of scatter diagrams 
for each input variable against the measured output characteristic, 
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It is readily apparent that the difference between the High and the 
Low levels of Variable I contributes nothing to the total variation, 
The change from High to Medium to Low levels on Variable II ~ake a def- 
inite contribution to the total variation and similarly for Variable III, 
Thus, less total variation is possible if Variables II and III are re- 
stricted within narrower operating limits, 


The illustration shows the technique stripped down to its simplest 
elements, While it is somewhat more involved in design and in analysis, 
it is a basically simple and attractive approach which quickly gains the 
confidence of both management and operating level personnel, 


One final step may be required in establishing the optimum operating 
conditions for identified critical variables, The Box technique of ev- 
olutionary operation (3) is a sufficiently simple approach which can be 
understood and appreciated by supervisory personnel, If two critical 
factors require optimization, small changes in these factors in series 
of 2 x 2 designed experiments will indicate the direction for optimm con- 
ditions until they are finaily reached, 
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When the systematic activity has removed the largest source of 
quality loss and the second largest and so on down the line, a point must 
finally be reached at which the reduction of the next largest source of 
quality loss would no longer justify the expenditure of QC dollars, If 
the major causes contributing to quality losses, as uncovered, are placed 
under a reliable system of operating or process control, and if specifi- 
cations on operating conditions are written or re-written to include the 
important discoveries, an economic quality level once attained can be 
maintained at relatively low QC costs, 


A complete QC program, as mentioned above, must begin at the earliest 
conception of the product and follow through to the consumer, The science 
of Quality Control is sufficiently advanced to supply equally effective 
programs in every key quality area, QC losses on new products would be 
at a minimum if effective QC planning were brought in at the ground floor, 


When the quality losses are high because of the absence of a complete 
and effective QC system, Management will not disregard a program based on 
facts in dollars and cents, but will support one which makes sense as well 


as dollars, 


This is the kind of program which must and can be sold to top man- 
agement, 
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QUALITY CONTROL STANDARDS FOR LABORATORY PRECISION 


George T. McCollough, Jr. 
Wyandotte Chemicals Corporation 


Controlling continuous processing operations and the qualities of 
flow-type products have created many unique problems for the chemical 
industry. Special measures of quality are often required that are dif- 
ferent from those found in other industries. The filling line for med- 
icinal packaging, or the canning and drumming operations for other chem- 
ical products can be evaluated by the standard quality control methods, 
but the qualities of the bulk material being fed to these end-use pack- 
aging operations may not be so readily measured or controlled. 


The chemical plant operator has long been aware that frequently 
chemical laboratory analyses left something to be desired. Sometimes 
he feels frustrated by the fact that the same raw materials, equipment 
and procedure today produces a product that rates a different quality 
result from the control laboratory than did the product of these same 
factors yesterday. At the same time, the laboratory chemist, equally 
aware of the same deviations from expected results, can find many 
reasons why samples can be variables. Phantom causes for differences 
are created and many misconceptions as to the real value of certain 
procedures, raw material qualities or other factors affecting the prod- 
uct may arise. In short, the precision and accuracy of chemical analy- 
ses have often been misunderstood and usually viewed more favorably than 
justified by reality. 


Chemicals are characterized by their frequent occurrence in large 
masses, either as a solid, liquid or gas. Both batch and continuous 
stream production are employed as well as combinations of the two. More 
often than not a batch of chemical, and particularly if it is a one 
phase liquid, will be homogeneous within itself, thus making the test 
precision the only variable pertinent to its evaluation. On the other 
hand, continuous production streams may be varying constantly, intro- 
ducing the sample as a potential source of variation. While standard 
sampling procedures have been developed for many chemical commodities, 
there is much room for improvement in this important area. The develop- 
ment of appropriate sampling plans for the sampling of heterogeneous 
chemical streams or masses often requires special adaptation to each in- 
dividual situation. 


The customary measures of quality are augmented in chemical quality 
control by many more abstract measures such as purity, solubility, pH, 
density, and the like. Many chemical product specifications contain re- 
quirements that special highly empirical tests be passed. Frequently 
these are attempts to duplicate or estimate in some way the ultimate 
end-use performance of the product. Such tests as dustiness indexes, 
foam height, toxicity, thermal stability and corrosivity are examples of 
these. In these empirical types of testing, the results obtained are 
usually only meaningful, as they can be compared on a relative basis to 
the results on other samples or standard materials using exactly the 
same procedures. Much of the quality evaluation is of the "variable" 
type with very little of the "go or no go" technique employed. 
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Sometimes time lag introduced by the test procedure is an important 
factor in complicating the operator's problem of controlling the 
process. This is particularly true where the material being checked 
must continue on to the next processing step without waiting for test 
results and adjustments as a result of the test. In rare instances, the 
time to run a test can extend to several days. As a result, more em- 
pirical relationships are sought to set up predicting tests which will 
reduce the risks created from the long test delay times. The precision 
of such tests as related to the longer test creates new opportunities 
for variance to enter the picture. 


Many chemicals are mixtures containing a relatively high value 
material diluted with a low value material. For example, in certain 
grades of sodium carboxymethyl cellulose, an important ingredient in 
many laundry product formulations, the active agent content may account 
for only two-thirds to three-fourths of the physical weight of the 
product, but will account for 96 or 97% of the product's cost. In such 
cases, sale of the product may be based on a guaranteed minimum content 
of the higher priced component. The manufacturing cost of the product 
may be a direct function of a test result. The producer in giving sound 
value wants to be sure he meets his guaranteed minimum, but he probably 
will not be paid for the margin of safety he includes. Thus, the qual- 
ity of the laboratory test evaluating the mixture may make a significant 
difference in the cost of the product to him. Occasionally the price of 
the product to the consumer will be calculated from a formula using an 
analysis result as a factor in computing the price. For example, 50% 
liquid caustic soda is sold on a basis where the unit price of each 
shipment is directly proportional to the sodium oxide content of the 
caustic liquor. Whenever pricing is directly based on the laboratory 
analysis as in this example, the precision of the analytical procedure 
becomes extremely important. Smooth vendor-customer relations can be 
difficult to maintain when the precision and accuracy of the test method 
are not well understood and agreeable to both parties. The frequently 
difficult question of "between laboratory" variation can enter the 
picture at this point, because the producer typicaily must pass the 
customer's test as the customer runs it, thus he must be concerned not 
only with the precision and accuracy of the test in his own laboratory, 
but also its precision and accuracy in the customer's laboratory. 


Historically, the laboratory chemist has armored himself against 
criticism by running duplicate or triplicate analyses. Typically, these 
were run from the same sample simultaneously, with identical manipula- 
tion and by the same operator. If good agreement was reached, that 
proved to him that the results were valid and that no errors had been 
made. The possibility that all the replicates were wrong was, as a 
rule, overlooked. Plant operating personnel have often resorted to 
taking new samples for recheck by the laboratory when first results (and 
their replicates) were not in the expected range. A better value report 
on the new sample gave everybody some comfort - the laboratory man 
blamed the sample for the differences and looked no further, and the 
plant man had the desired answer from the "better sample. The fallacy 
possible in this situation is obvious to us now that we are more 
sophisticated as a result of statistical quality control, but it was 
largely unrecognized until recent years. I might say that in practical- 
ly every case that I am familiar with, the variance of the chemical test 
procedure is understated by the results of replicate determinations made 
under the conditions mentioned above. There is validity in this approach 
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to reduce the variance of test data, but only when all the random fac- 
tors affecting the test results are encouraged to be present within the 
system of replicates rather than be eliminated. This may be accomplish- 
ed by running the replicate sample by a different person, or in a dif- 
ferent laboratory, or on a different day, or a combination of these fac- 
tors or others suitable to the particular situation, 


Overconfidence in the results of laboratory tests on the part of 
operating personnel can lead to overcontrol of the process. This can 
have serious consequences for both cost and quality. Many continuous 
processes have optimum operating conditions, short of which or beyond 
which efficiencies decline. If tests measuring the relationship of the 
process to an optimum condition are not precise, they may indicate an 
adjustment when in reality, none is required. This can lead to a 
cycling effect which will effectively spread the process performance so 
it averages at less than the desired optimum. Also, many such adjust- 
ments consume labor which could be better used elsewhere. A process 
that is out of control will create need for more laboratory services and 
frequently cause off-specification production creating more unnecessary 
costs. 


Summarizing to this point, it is a fact that many problems in chem- 
ical processing are created as a result of test imprecision, and that 
the beginning of wisdom in chemical plant control lies in a clear under- 
standing of the nature of the testing system as it is related to the 
process objectives that must be met, 


Ordinarily, the standard required for a test's precision must be 
looked on as a derivative of other important factors. First to be con- 
sidered are the specification limits within which the product must fall, 
and, secondly, the nature of the real process variability. Both must be 
in terms of the test being considered. 


There are a number of interesting aspects to the process of arriv- 
ing at specification limits, many of which are quite unscientific and 
unrelated to the vendor's ability to produce or test, or even to the 
customer's real need. However, no matter by what process, specifica- 
tions are set and they are the first reference points for establishing 
the requirements for test precision. 


Examination of the total variance of the system is required. The 
total variance is that computed from all of the test results on routine 
production, Abnormal results arising from process upsets due to assign- 
able causes should be eliminated from the estimate of total variance. 
Resolution of the total variance into its components follows - how much 
is from real process variation, how much is from sampling, and how much 
is contributed by the analysis procedure? To be reasonable, the speci- 
fication must allow at least enough room for an amount of real process 
variation consistent with the capability of the process. The difference 
between the variance allowed by the specifications and the amount that 
will be needed for the real process variation is the maximum that can be 
tolerated in the analytical system and sampling. This relationship is 
expressed in the following equations. 
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Equation 1 
2 2 2 . a 
(Spec. Range/6) 2 (F =Tp ~~ as ? 


2 
(Spec. Range/6) = Total variance allowed by specification range. 
oF = Total variance of system as observed from analytical 
results on product samples. 
Op = Real process variance which estimates the true varia- 
tion of product quality about its mean value. 


a Variance of analytical procedure which estimates the 
tendency of the individual test result to vary from 
the true value of the sample analyzed. 


Ts = Variance introduced through sampling which estimates 
the tendency of the individual sample to vary in 
quality from the true quality of the material it was 
taken from, 


Real process variation must be stated in the terms provided by the 
test method, Real process variation is estimated by the difference be- 
tween the apparent total variance and the amount of variance that can 
be attributed to sampling and the analytical system as in this equation. 


Equation 2 
2 - OF az" +57) 


Since the total variance of the system (oF) can be estimated from 
operating data and the determination of variance due to sampling (g3* ) 
can be estimated by an experimental procedure and since both must be 
stated in terms of the test result, the determination of a practical 
value for the variance of the test (7) becomes the cornerstone upon 
which the understanding of our process system is based. 


Upon inspection of Equation 1, it can be seen that in order to 
evaluate the total variance of the system as related to the potential 
variance allowed by the specifications, the sum of real process, sampl- 
ing, and test variances must be considered. After the values of these 
have been determined, inspection of the equation will suggest whether 
any action is necessary, and if so, what direction it may most profit- 
ably take. 


An estimate of the variance arising from sampling (@3*) can be made 
by selecting several samples from a typical batch or increment of prod- 
uct and analyzing each several times. Analysis of variance into within 
and between samples will provide a method for attaining the value re- 
quired, The design of this experiment must be considered carefully in 
order that a sufficiently precise estimate of sampling variance is ob- 
tained to make the term usable in the total variance equation. Care to 
replicate the analyses in such a way that full value is given to the 
within samples variance is important. The inclusion of more than one 
batch or increment of production may be advisable. Such an experiment 
will also provide an estimate of the test precision Ge’) which can be 
used in the over-all equation. In many cases, experience will indicate 
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that the sampling variance will be nearly equal to zero. This is 
usually more likely in liquid systems, and in such cases, a different 
approach to estimating test variance G” can be used, 


Considerable thought should be devoted to the relationship of the 
method of selection of the sample to the character of the process. A 
flowing fluid, for example, may be varying continuously over a certain 
range. Instantaneous samples taken from this stream will reflect the 
complete range of variation at that point in the system. If a surge 
tank is in the system downstream from the sampling point, mixing will 
occur and samples from this tank will reflect less variation than was 
present in the samples of the stream feeding into the tank. Similarly, 
drip samples or composite samples from the flowing stream will deaden 
the effect of the variability in the stream. Since the control we are 
seeking is on the product of the process, the ultimate size of the prod- 
uct unit, which could be anywhere from gallon cans or less to tank cars 
or larger bulk storage units, should be considered in deciding on the 
sampling plan. It is easily possible for the sample variability to be 
less than that among the final product units of the process. To this 
extent, true process variation will be concealed, and we will obtain a 
falsely optimistic estimate of the probability of the product unit pass- 
ing the customer's acceptance inspection. 


The most satisfactory situation will be obtained when the samples 
analyzed are as truly representative of the product and its variability 
as possible. This will tend to give ag;" approaching zero and simplify 
the over-all problem by eliminating a complicating variable. 


When the real process variation has been established, the need for 
improvements in the analytical method can be evaluated. If the real 
process variation is within the specification limits, but the variance 
contributed by the analytical method is enough to cause some good prod- 
uct to be rejected or marginally bad product to be passed, method im- 
provement is a possibility. A realistic appraisal may reveal that 
satisfactory improvement is possible. On the other hand, variance con- 
tributed by the analytical procedure can be reduced by obtaining more 
than one determination on the sample. While care must be taken to avoid 
the pitfalls of "replicate" testing, variance reduction up to the maxi- 
mum expected from the formula 


2 
Ga / 
a> 


can be achieved. Because of the fact that time is usually critical, 
perfect conditions for arriving at independent estimates may not be 
achieved enteey eith be larger than might normally be thought. This is 
why this equation is written as an inequality rather than in its usual 
form of an equality. 


If sampling variance is significant, the additional tests may be 
run on additional samples which will tend to bring the average analysis 
still closer to the true product value. The number of samples and 
analyses per sample can be selected so as to give the desired probability 
of error in deciding whether to accept or reject the product. 
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If the real process variation is such that it exceeds the specifi- 
cation limits, obviously some bad product will be produced and the 
quantity of bad production can be estimated, Here again, inspection of 
the variance contribution of the various elements will give a clue as 
to the way to resolve the situation. For instance, a careful examina- 
tion of the basis for the particular numbers used as specification 
limits may be advisable. Sometimes, when it becomes obvious that the 
existing process is incompatible with the product specifications, a 
change in specification can be arranged satisfactorily. Where this is 
not possible, and frequently it is not, the need to be exact in separat- 
ing bad from good product should be evaluated and agreed to by the in- 
terested elements in the business. This decision should be based on a 
knowledge of the test precision, the cost of improving it, the cost of 
rework of bad product, the possible long run effect of marginal quality 
deliveries on the long run sales picture and other factors pertinent to 
the particular problem. The sampling and analysis plan should be so de- 
signed as to obtain the agreed upon degree of exactness in appraising 
the product. 


The need for process improvement can be better understood when the 
real process variation is known as distinguished from the observed total 
process variation, Usually the cost of reducing process variation in- 
creases at some rate above that which a linear relationship would indi- 
cate, The probable cost of improved process stability will be more 
realistically viewed when the contribution of analysis error to the 
total observed variation is understood, 


The standard for control laboratory precision must be considered 
on a case-by-case basis. The facts of each situation when understood 
will dictate the precision required for the analytical procedure in- 
volved, Precision can be improved*basically by one of two methods - 
getting a better test, or running more results per sample. Which course 
is the better to follow will be principally a function of the cost of 
the test and the technology available for improvement. 


As a continuing technique for evaluating and controlling test pre- 
cision or variance we have used the following simple program. Samples 
run by the analyst are commonly set aside as "retainer" samples for re- 
checking should some future development indicate a need for re-analysis. 
The laboratory leader feeds back to the analyst at intervals an unknown 
sample which is in reality a portion of the "retainer" sample he has 
previously tested. His recheck analysis is made without his knowing 
which of the samples he previously tested he is running, though he may 
realize he is checking himself. A graph of the range of the original 
and recheck result is maintained in the laboratory. This is kept as a 
modified range chart with plus and minus values being plotted. (See 
example Figure 1.) Plus and minus values for the range are obtained by 
subtracting the recheck result from the original and keeping the sign of 
the answer, R-bar for computing control limits is obtained by averaging 
the individual ranges as if they were all plus values, as they ordi- 
narily would be considered. The usual upper control limit for range, 
for sample size equal to two, is charted both as a plus and as a minus 
value. From R-bar, the variance of the analytical method is estimated. 


Figure 1 


DETERGENT COMPOSITION 


ORIGINAL RESULT MINUS RECHECK 
RESULT 





In addition to providing the needed estimate of the analytical 
variance, this type of test rechecking and charting have provided in- 
teresting trouble-shooting values, and their maintenance is primarily 
justified as a laboratory test quality control device. Figure 2 shows 
a chart prepared from the original data taken through this recheck pro- 
cedure on the method shown in Figure 1. Note that initially the limits 
for range were very wide compared with the later time period of the 
chart. The reason for the improvement in precision in this case was the 
result of investigation and modification of the test method. This im- 
provement was mandatory because the strength of a detergent product was 
being controlled by it. The variance of the analytical method was much 
larger than that allowed in Equation 1 when allowance was made for real 
process variation inherent in the system. 
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Another example of a problem brought to light by the recheck pro- 
cedure is illustrated in Figure 3. Here, an obvious bias to the plus 
side indicates that the A.P.I. gravity value obtained from the sample 
is usually lower on rechecking than was obtained on the fresh sample. 
Such a consistent bias could come from product change on aging, but in 
this instance, the original sample was usually tested for gravity while 
hot and corrected to a standard temperature while the second test was 
run at room temperature. The standard correction value for temperature 
was found to be slightly in error which accounted for the difference. 


Figure 3 
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In most cases, an operator rechecks his own original work. 
Plotting all operators' results on the same chart helps to point up the 
quality of the individual's performance. Statistically, this intro- 
duces the possibility that between operator differences exist that are 
not reflected by R-bar. Where this is suspected, cross-operator re- 
checks will rectify the statistical estimate of variance, but will 
limit the usefulness of the chart as a control of individual operator 
performance. 


In summary, because of the considerable possibility for variance 
in chemical laboratory testing, establishing the standards for the pre- 
cision required of a test method will require the evaluation of several 
other factors. These are: 

1. Am analysis of the specifications established for the product. 


2. The observed total process variance. 


3. The relative contribution to observed total process variance made 
by real process variation, sampling and laboratory analysis. 


4. The determination of the relative "fit" of the variance obtained 
in practice and that allowed by the specifications. 
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5. The exactitude with which acceptance or rejection decisions are to 
be made on product approaching specification margins. 


6. The scale of the penalties both short and long range that will 
accrue as a result of making incorrect quality decisions. 


305 




















STOPPING RULES FOR CONTINUOUS PRODUCTION 


R. B. Murphy 
Bell Telephone Laboratories, Inc. 


In continuous inspection poor quality over an extended period may 
lead to more inspection effort than is desired. The producer, consumer, 
or both may feel that economic and quality interests could be well 
served at such times by resort to a special action aimed at improving 
the process. There has been considerable interest recently in providing 
criteria based on inspection data to determine when such action should 
be taken. Such criteria have been called stopping rules because in many 
cases production is stopped when the special action is called for. 


The present discussion concerns four such stopping rules as applied 
to Dodge's original CSP-l1 plan, in which inspected units must be classi- 
fied as either "defective" or "nondefective". The basic plan starts 
with inspection of each unit made in succession until a specified number, 
i, of consecutive units are found nondefective. A sequence of units so 
inspected is called a screening sequence. After the initial screening 
sequence is ended, the inspector samples a fraction f of the units pre- 
sented to him. He continues to sample until he finds a defective unit. 
At this point screening is resumed, and the procedure is repeated in- 
definitely. The inspector rejects (or sets aside for correction) any 
inspected unit found to be defective and accepts all others. 


Of interest are the practical aspects of applying the four stopping 
rules as well as operating characteristics of these rules which may 
serve as a guide to selecting a suitable version for application. 
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TABLE I 


STOPPING RULES FOR CSP-]_ PLANS 


Rule (n* - i): Stop as soon as a defective unit is found in any one 
screening sequence after the sequence has exceeded 
n* - i units. 


Rule (r): Stop as soon as a specified number r of defective units 
are found in any one screening sequence. 


Rule (N,R): Stop as soon as a specified number R of defective units 
are found in any block of a specified number N of in- 
spected units. (Blocks do not overlap.) 


Rule (n*): Stop as soon as a specified number n* of units have been 
inspected in any one screening sequence without ending 
it. 
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TABLE IT 
D 10 OTATI 
The probability that a defective unit is produced = Process average. 


ae) 
i] 


1-p. 


a 
" 


i = The number of nondefective units produced and inspected in a row 
before sampling may be used. 


La) 
i] 


The average fraction of output inspected when sampling is used. 


4 
" 


The total fraction of units inspected. 


PNQL = Producer's Nominal Quality Level, a value of p chosen with the 
AOQL and F to fix f and i. 


T,, = The probability that no run of i nondefective units has occurred by 
the time n units have been produced. (NOTE: For brevity the ex- 
pression for T,, is omitted. ) 


yr 
t 


= (1-Tye)/(1-qi tye - Tp¥+4) = Correction factor for characteristics 
of Rule (n*). (See Table IV.) 


D, = (1 - q*)? = The probability that no run of i nondefective units 
has occurred by the time r defective units have been 
produced. 

2 M\_m nm 
B(x) = = (DP q = The probability that no more than n units have 
m=x been produced by the time x defective units have 
been produced. 

Stopping cycle = The elapsed production period from one time a given 
stopping rule is invoked until the next time it is 
invoked. 

Counting cycle = The elapsed production period from one time defectives 
begin to be counted until the next time they begin to 
be counted in accordance with a given stopping rule. 
(Applies only to Rules (r) and (K,R).) 

eo 

m, = The (variable) number of units produced in a stopping cycle. 

a = The (variable) number of units inspected in a counting cycle. 

E(m,) = The average of m,. 


E(m,) = The average of My 
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* 1D STOPPING R 
Specified 
AOQL 1% 1% 5% 5% 
PNQL 005 005 025 025 
F at PNQL 10 225 10 225 
E(mp) at PNQL 104 104 104 104 
CSP-1 Plan 
150 70 30 15 
f 05 +20 05 +20 
Actual 
AOQL 1% 1% 4.8% 4.6% 
PNQL 005 005 025 2025 
F at PNQL 1004 2620 1011 ©2677 
n* 251 127 79 36 
n® - i 101 57 49 21 
E(mp) 10, 920 11, 520 10, 000 6, 877 
Rule (r) 
r 2 2 4 3 
E(mp) 10, 900 11, 290 9, 701 6, 707 
E(mmg) 305.8 259.2 78.64, 56.63 
Rule (N,R) 
N 400 175 97 LL 
R 3 3 5 4 
E(mp) 11, 990 11, 260 9, 737 6, 821 
E(me) 356.7 172.1 94.99 43.78 
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QUEUEING THEORY AND SOME OF ITS INDUSTRIAL USES 


Roger I. Wilkinson 
Bell Telephone Laboratories, Inc. 


Delays in giving service occur whenever in any process or systen, 
the momentary demand exceeds the available capacity of the serving 
personnel or equipment. Familiar examples are people waiting in the 
checkout line at a supermarket, at the railway ticket window, and in a 
doctor's office, Automobiles waiting to cross arterial highways and toll 
bridges, airplanes seeking to land at busy airports, and ships awaiting 
dock space to unload are examples of costly and critical delay situations 
in the transport field. In industry there are such diverse waiting line 
problems as the flow of materials ami machinery to assembly and test 
positions, the extent to which inoperative machines should be required to 
wait for repairs, and the economic backlog of typing work in a central- 
ized stenographic department. In all of these the waiting "customers" 
comprise a queue. Queueing theory is the study of the formation of such 
customer waiting lines ami the characteristics of the delays they 
experience. 


The earliest mathematical studies of delays were made by A. K, 
Erlang of the Copenhagen Telephone Company; he published his theories 
over a period of years beginning in 1 otil. Other well known contribu- 
tors to the field include E. C. Molina[ 2], T. C. Fry, C. De Crommelin [3], 
F. Pollaczek (4), C. Palm and J, Riordan, all of whom were strongly 
concerned with the flow of traffic through telephone switching systems. 
Since World War Il the techniques of queueing theory have become an 
important tool of the workers in Industrial Engineering ami the popular 
Operations Research field, Hundreds of articles have now been written on 
queueing theory ani its applications. 


A short derivation of the Erlang formulas is given in the Appendix. 
The assumptions made in this analysis probably correspond more closely 
than any others to the preponderant mumber of applications so far made in 
industry. References 5, 6 and 7 give bibliographies of papers containing 
the development of theories and formulas for a very wide variety of 
practical and theoretical situations, T. L. Saaty[ 8] has recently made 
a résum& of useful queueing formlas; and P, M, Morse[9] has provided a 
readable and comprehensive analytical approach to general queveing 
problems. 


Elements of the Queueing Process 


In the upper part of Fig. 1 are shown pictorially the principal 
elements required to establish a queueing situation, At the left are a 
number N of Sources whose Demands arise at successive instants of time, 
These comprise the Input Process*., If, when the demanis reach the 
Servers - assumed to be c in mmber = all servers are found busy, the 
demands will normally wait, along with any others already there, until 
their service moment arrives. When a server becomes available, a choice 
is made of some one of the waiters to be accommodated. The particular 
manner of making this choice is called the Queue Discipline. There will 


*The terminology used Here is considerably based on that of D. G, Kendall. 
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then be a Service Time during which the demand is satisfied, following 
which the demand is dismissed. The point in time character of the dis- 
missed demands is called the Output Process, The latter is of particular 
interest when the output is all or part of the input to a second queueing 
system, 
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Fig. 1 - Display of Queue Elements, and Common Assumptions 


In the lower part of Fig. 1 is a table displaying the most commonly 
encountered choices for the several elements of the queueing process. 
Under Input Process the selection of N very large, or infinite, simpli- 
fies the analysis since then the arrival rate of demands is unaffected by 
the number of sources already engaged in waiting or being served, i.e., 
the demand rate is constant. 


The Character of Arrivals may be highly diverse. The two simplest 
cases are those where the demand instants are at random (their inter- 
arrival times have an exponential distribution), and where they arrive at 
regular intervals (constant interarrival time). The most general assuup- 
tion would of course leave one free to select any desired distribution of 
interarrival times, 


In delay systems involving people, first-come-first-served is 
commonly accepted as the fairest rule by which successive selections are 
made from a line of waiting customers. However, attainment of this 
Queue Discipline may require substantial expense, In this event a random 
choice for the next-to-be-served from among those waiting may occasionally be 
acceptable, Certain classes of demands (customers) may merit Priority 
over others who have arrived earlier (e.g., emergency telephone calls), 
One can imagine circumstances in which last-come-first-served might be 
observed (cf. a new baby in the house!), Customers served in Batches are 
not uncommon (buses, taxis). Defections from the queue may occur if the 
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delay becomes too great, or refusal to enter the queue (Balking) may be 
the result of seeing too long a waiting line. 


The distribution of Service Times is critical in determining the 
distribution of delays experienced by the demands. It is very common in 
real life (e.g., repair of machinery, retail castomer purchasing, tele- 
phone conversations) to find Exponentially—distributed Service Times, 


o(t)dt = 3 e-t/hat (1) 


Again, in many machine operations a Constant Service Time occurs. More 
specialized service time distributions are sometimes described by 
Pearson's Type III curve, while, of course, a general frequency function 
will describe service times of any character [1],[5]. 


In the last column of the table of Fig. 1 are given several 
historical references of practical importance to appliers of queueing 
theory; at the end of a line associating assumptions made in successive 
columns, is the author's name and the date at which he published his 
findings. Further details are found in the list of references. 


Criteria of Delay Service 


Depending on the circumstances, interest in the results of the 
queueing process may take different forms. 


The Probability of Delay, POO), is the simplest index of delay 


service. In instances where the penalty is great for being delayed at 
all, selection of a small acceptable value of POO) may suffice to 
engineer or administer a system. Thus a delay in supplying a necessary 
part in an assembly line might cause a process to stop with attendant 
confusion and large costs of starting up again. 


ity o De Greater than t, P » will often be required 
especially if some secondary event tends to occur as delays exceed cer- 
tain critical amounts. Waiting lines of customers, who begin to complain, 
or even defect, if the delays seem unreasonable, are of this sort. Again, 
airplanes with limited fuel which encounter landing delays have a strong 
interest in the probability of delays beyond certain limits. 

Av D t, is required to be known when the aggregate of many 
delays, rather than the magnitude of any individual delay, is important. 
Thus a factory manager might want to estimate the total lost machine time 
(and hence production), due to machine stoppage when repair men or 
facilities are not always immediately available (due to being engaged 
with other stopped machines). 


° ty o eue Length Exc » will be desired when 
special provision for ng the queue is required. Thus the nunber of 
seats in a waiting room, or buffer storage units in a computer, will need 
to be sufficient to strongly-insure that no more than a very few demands 
are turned away. 


Average Length of Queue, a> is needed to estimate the aggregate lost 
time of waiting demands. If employees waiting for a within-the-business 
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service, average too great a delay time, it may be economical to hire 
more servers ami reduce the average length of queue. 


thematical Formul 


Let a = average simultaneous demands being served (if none defect) 


= mumber of demands per average service time h 


Ni bid er hour Av ervice time per bid in sec (2) 
00 


(Note that the unit of load "a" is appropriately called the erlang.) 
The "occupancy" level, or load per server, is then defined as G = a/c, 


For random (Poisson) input, no defections, and exponential service 
times (and very closely so for constant service times), the probability 
of a particular customer being delayed, i.e., meeting a delay greater 
than zero = as derived in the Appendix, eq. 13 - is given by 





P60) « (ace"®/c}) [c/(cma)) (3) 
r= BF aXena/x! + (acen8/ct) [c/(cma)] 
x=c 


Values of P(>0) are graphed on Fig. 2 (taken from[10]), whose 
abscissa is load a; the numbers on the curves correspond to the numbers 
of servers c. For example, a load of a = 3 erlangs offered to five 
servers results in a probability of delay of 0.236, that is 23.6% of 
customer demands can expect te find all servers busy am thereby have to 
wait for service. 


Delays Greater Than t, Exponential Service Times 


The probability Pt) of a customer demand being delayed longer than 
t is the product of two probabilities: (Probability of being delayed 
at all) X (Probability, when delayed, of being delayed longer than t). 
This may be written 


POt) = POO) F(u) (4) 


where u = ct/h, Here the presence of u clearly introduces the delay time 
t, the number of servers c ami average service time h into the solution; 
one would expect the result to depend on each of these factors. P(>0) 
values are read from Fig, 2. For first-come-first-served situations, 
values of Fo(u) = e~(cm8 th - derived as equation (20) in the Appendix - 
are read from Fig. 3. For cases where waiting customers are served in 
random order, there is no corresponding simple expression for FR(u); 
approximate values, however, may be read from Fig. 4. 


Example 1: Delays at Airplane Ticket Counter. If in the average busy 


hour 48 customers purchase airplane tickets at a two-position sales 
counter, what proportion will have to wait more than 5 minutes? 
Suppose service times are roughly exponential and average 2 minutes. 
Assume (a) first~come-first-served, ami (b) customers served at 
random, 
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Sozutton (a) Queued Service. a = nh/3600 = (48)(120 secs.) / 
OO secs = 1.6 erlangs; © = 1.6/2 = 0.3, From Fig. 2, PO) = 
0.71. From Fig. 3, using u = ct/h = (2)(5 min.)/2 min, = 5, read 
Fo(u) = 0.37, whence PoO5 minutes) = (0.71) (0.37) = 0.26. 


Ai eA certs, Same as (a) except from Fig. 4 read 
Fp (u) = 0,27. Then Pp 5 minutes) = (0.71) (0-27) = 0.19. 


As suggested above, the probability of finding both ticket 
sellers busy (0.71) is imiependent of the order in which waiting 
customers are served, This however is not the case when delays of 
some specific magnitude are considered. In the present example a 
third fewer customers must wait longer than 5 minutes if they are 
served at ramiom! At 10 minutes, the same per cent would wait; 
beyond 10 minutes, more would wait than if queued service were 
maintained, It will be seen that with ramiom service more very 
short and more very long delays occur, Thus the length of delay at 
which the criterion of acceptability is set, may have a considerable 
effect on the selection of the best queue discipline, 


Delays Greater than t, Constant Service Times 


Although the probability of suffering a delay at all, POO), is 
relatively insensitive to the service time distribution,* the probabili- 
ties of delays POt) when t>0, are highly dependent on the character of 
the service time variations. The only case which is widely tabulated to 
date, in addition to that for the exponential service times considered 
in the previous section, is for constant service time with demands 
handled first-come-first-served, Even here the mathematical theory is 
surprisingly difficult and does not lend itself to constructing a simple 
pair of charts (such as Figs 2 and 3), covering a wide field of loads 
and servers, (For constant service times ami rariom order of service, a 
reasonably computable formula remains yet to be developed.) Crounelin { 3] 
first published a set of charts for selected mumbers of servers to c = 20, 
and occupancies to 0.8. Two of these, for c =1, 2 servers, with higher 
occupancies added, are shown in Figs. 5 and 6. 


e 2: Telephone ineering for Dial Tone ice. A relay-type 
controller serving 2000 lines in a certain automatic telephone 
system requires 0,5 second to connect dial tone, If each line 
originates an average of two calls per busy hour, what per cent of 
the calls will have to wit more than 1, 2 and 3 seconds for dial 
tone -with queued service? 


Solution: The occupancy of the single server (controller) is 

@ = (2000)(2)(0.5 sec.)/3600 secs. = 0.556. Since with no conges- 
tion, the controller requires 0.5 second to give dial tone, the 
delays of interest are 0,5, 1.5 ami 2.5 seconds. Expressed in terms 
of average service time, they become 1, 3 am 5 units of delay, 
respectively, Reading on Fig. 5 (constant service time for c =1 
server), the probabilities shown in the last column below are found. 


*P(>O) values for constant service times are very slightly lower than the 
values given for exponential service times on Fig. 2. A chart_showing 
the true constant holding time values is given by Pollaczek (4). For 
most engineering purposes the difference is negligible. 
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D Ton Corresponding Delays Prob. of Exceeding 
Seconds In Seconds In Service Times Dial Tone of Col. 1 


1 0.5 1 0.22 
2 1.5 3 0.026 
3 2.5 5 0,003 





Average Delays 


In many situations, principal interest is in the average length of 
delay. As noted previously, this is likely to occur when the cumlation 
of delays represents lost time or expense to an individual or company, as 
distinguished from the annoyance felt when some particularly long delay 
is encountered. It may readily be seen that the order of service from 
the queue is inconsequential in determining average delay, as long as 
there is no tendency to favor either long or short service time demands 
in the service order. The distribution of service times, however, has a 
considerable influence on the average delay, as we shall now see. 


For CORTES service times the average delay - derived as 
equation (18) in the Appendix - is 


te = POO) n/(c-a), (5) 


while for constant service times it is (3) 


co co 
Ten dewl|y GW _ 2 F Low o 
c v! a v! ) 
w=] vV=we ° v=wetl 


These two expressions are represented on Figs. 7 and 8, respectively. 


Example 3: Optimum Sige for Oil Truck Filling Station. The owner of a 


fleet of oil delivery trucks wishes to estimate how much delivery 
time his trucks are losing due to waiting at his filling station, 
and how much this would be reduced if he installed additional fill- 
ing units. Suppose he now has 28 trucks and four filling pumps, 
and it requires 20 minutes to fill a truck. If each truck needs 
three fillings on the average per Shour day, what is the average 
truck time lost waiting at the filling station? How much waiting 
time would be saved if he installed two or three more pumps? 


Solution: The “load on the filling pump system is 


eo zz 20 minse . 
0 = BAIS cine” 3050 ordangs. 


With four pumps the occupancy is © = 3,50/4 = 0.875. 

Enter the constant holding time average delay curves, Fig. 8, with 
c = 4 ami a = 0.875. Read that average delay equals 0.75 holding 
time = 0.75 (20) = 15 mimtes. For the fleet as a whole then the 
daily lost trucking time is 15 x 28 x 3 = 1260 truck minutes, or 

21 truck-hours. Thus due to inadequate filling facilities the time 
of nearly three trucks, on the average, is being wasted. 
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FIG. 7 - AVERAGE DELAY ON ALL DEMANDS, WITH EXPONENTIAL 
SERVICE TIMES. 
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FIG. 8 - AVERAGE DELAY ON ALL DEMANDS, WITH CONSTANT 
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The following table shows the improvement in lost truck time if 
r additional filling pumps are installed: 





Added Pumps Total Pumps Pump Avg. Delay Daily Lost Time 





r c+r Occupancy te (truck hrs) 

8) 4 0.875 15.0 mins, 21 hrs. 

1 5 0.700 2.7 mins. 3 hrs. 47 mins. 
2 6 0.583 0.78 mins. 1 hr. 5 mins. 
3 7 0.500 0.27 mins, 23 mins. 





Two small corrections might well be applied to this exemple. The 
actual delays will not be quite as great as those predicted here 
since infinite source theory has been used to solve a system with 
only 28 sources. A correction should also probably be made to allow 
for slightly improved delivery schedules as the number of pumps is 
increased, and with correspondingly increased demands by each truck 
for refilling at the pumps. 


It seems likely that the oil distributor would conclude that he 


should install two, and possibly three more pumps (business might 
increase). 


Length of Queue: 


For exponential service times the probability of a queue of length 
q or longer, is shown in the Appendix (equation 14) to be 


Qtzq) = (&)* Poo) (7) 


while the average queue length is q = POO) ois. Moreover the average 
delay on demands delayed, t, and the average queue length during periods 
of all servers busy, q, are related by 


= = + 1 
te “a. (8) 
in which tg = tg/PO0) and § = 7/P0). 
Equation (8) is satisfactory for exponential service times only, For 


constant service times, average queue length may be estimated from the 
approximate relationship 





a 
to = Th + “E>! (9) 
or its equivalent 
«oni. ee 
bs Fe c h etl (10) 


where to/h is read from Fig. 8, and P,~(0) may be approximated by the 
exponential service time values, POO), of Fig. 2. 
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3 ctio Test Facilit - Ina certain 


factory, complex machine assemblies come off several production 
lines at irregular intervals, averaging 42 per hour, am are then 
sent to one of eight test positions for acceptance or rejection, On 
the average it requires 10 minutes to test a machine although due to 
their intricacy and the adjustment possibly required, this is a 
highly variable time, found to be roughly exponential. 




















(a) If machines are stored on dollies awaiting test, how many 
dollies should be provided so no oftener than once in 500 cases 
will there by no place to park a completed machine? 





(b) All production is required to be passed through test at the end 
of the day. After 5 P.M, when production stops, the cost of 
testing increases by $15 per waiting machine. How much could be 
spent per day toward the purchase ami operation of an additional 
test position? 


Solution: Assume raniom arrival of demands, and exponential service 
times; then n = 42, c = 8, h = 10 minutes, so that a = nh/60 = 7.0 
ime, apda = a/c = 0,875. From Fig. 2, POO) = 0.54; by equa- 
tion 5), tg = h/(c-a) = 10 mimtes, 





a 
(a) Set 0,002 = Q(aq) = (@) POO) = (0.875)9(0,64). Solving for q 
gives q = 43.2; hence 44 dollies should be provided. 


(b) At 5 P.M. (as at any other time during the day) the expected 
mumber waiting in the queue is 


= POOdgas = 0.64 why = 4.h8 
If another test position were installed (c = 9), then 
at = = A = 
q 0.38 9-7 1.33 


The average daily savings in overtime testing would be (4.48 - 
1.33) 15 = $47.25 which could be spent toward an additional test 
position and operator. Note also that the number of dollies re- 
quired in (a) wuld be greatly reduced (to 21). 


Swmary 


The elements of simple queueing problems have been displayed ani 
some formulas given by which probabilities of delays and expected delays 
may be calculated. Working curves are shown ty which many common queue- 
ing problems can be either exactly or approximately solved. Examples 
are given which illustrate the use of the curves in practical situations. 
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APPENDIX 





Derivation of Formulas for 
t Ho. T 


Let c = number cf servers 
a = rate of demand arrivals per average service time 
t = time of delay 
h = average of exponentially distributed service times 
f(x) = probability of x simltaneous demanis at any random instant 


Assume: 
1. Number of sources is very large compared with servers so 
arrival rate is independent of the number of sources being 
served or waiting. 


2. Demands which are not served immediately, wait ami are served 
in order of their arrival. 


If a<c, the system willattain stable equilibrium, and the equations 
relating the state probabilities are: 


For O<x<e, (atx) f(x) = af(x-1) + Getl) t(xrl), or f, (x) -= t(0) 








(11) 
x 
For xc, (ate) f(x) = af(x-1) + cf(xtl), or f(x) = —*— £(0) 
™ cXCe} 
c~l co 
Since Z f(x) + fo(x) = 1,* 
x=0 x=c 
—a 
t(0) = 2 = : (12) 
. a* . > aX c=l axe"4a am ace~& , 2 
x=0 x! x=c or-Ce! x=0 x! c} c-a 


*If at some queue length b, customers refuse to join the waiting line 
(i,e., balk), the upper limit of x will be c+b instead of infinity. As 


a result A, wherever it appears in equations (12) and (13). will be 
modified by the factor por} The probability of balking will, of 
course, be simply f5(ctb). 
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Probability of Del 


The probability that any randomly chosen demand will find all 
servers busy amd be delayed (more than zero time) is 


co co x 
POO) = £ f(x) =2 —S— r(0) 
x=™c x=™c c c! 


ate“ _¢ 

ci oc=a 
- ate"* . a®o™ cc 
x=0 x! ci cea 





(13) 





Note that all terms except Ta are either Poisson individual or summation 


terms, so that evaluation of P(O>0) is easily accomplished. Values of 
PO) are read from Fig. 2.* 


Queue Length, Q(%q) and 9: 
The probability of finding a queue of length q, or longer, is 


co co x 
Qbq) = Ff folx)= 2 —&— £(0) 
x=qte xeqte cS! 


oo 
= &)7 £ f(x) = &)? PO) (14) 
x=c 
The average queue length is 


oo c co x 
q= £ (x-e)f,(x) = £(0)S 2 (x-c)() 





x=c ct x=c 
= r(0) © (@)° —B = Poo& (15) 
ef ‘ec a8)? cma ? 
-2 


The average queue length when all c servers are busy is, then, 


*Tables of P(>O) under the expression E5,,,(A) are given in Table III of 
MOE'S PRINCIPLE, A. Jensen, Copenhagen Telephone Company, 1950 for 
ranges of offered loads A = .1(.1)10, 10 (1)50, 52(4)100, and E = 0,0001 
to approximately 0,6000, 
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Average Delay, tg: 





When there is a delay, the delay equals the wait for (number in 
queue + ]) demands to be served, During periods of all-servers~busy, 
calls are served at a rate of c per average service time h, Then the 
average delay for calls delayed is 


= S+1_ ration 
tg 2 8s - Sh (17) 


and the average delay on all calls is 
-_- os h 
tg = POO) tg = POO) ce (18) 


Distribution of Dela POt): 


The probability of a random demand finding m calls in the queue is 
fo(x = mtc), Then the demand must wait for mt] calls to drop out of the 
system,. When all-servers-are-busy this occurs at a rate of c per aver- 
age service time h; or ct/h per time t. The probability of the demand 
having to wait longer than time t, equals the probability that fewer than 
m+] will drop out in time t, or 


r-ct/h 


r! 





paOt) = z (ct/n) 
r 


The total probability of a delay >t results from summing for all m: 





POt) = z fo(mtc) pat) = $ £(0) Pe t (ct/n)?_enct/h 
m=O m=O cc! r=0 r! 


. £(oy <= en (c-a)t/h = p60) e~(cma)t/h (19) 


And the probability of a delayed call being delayed >t is 


Fo(u) = e~(cna)t/h (20) 


where u = ct/h. Values of Fo(u) are read from Fig. 3. 
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DEVELOPMENT OF QUALITY ATTITUDE AND PRIDE OF WORKMANSHIP 


Harry E. Sagen 
Abbott Laboratories 


Quality attitude, quality mindedness, and quality consciousness 
are used synonymously, but quality attitude implies something more than 
just being aware of quality. 


Even before the early industrial revolution, craft guilds rose and 
fell on the quality of workmanship. Pride of workmanship with quality 
attitude manifested through the artisans product was the keynote of the 
day. Throughout the years, the successful craftsman must have con- 
stantly strived for quality by the very reason for his continued 
existence. 


In those days, the Acceptance Quality Level was truly determined 
by the customer, while the Average Outgoing Level was uppermost in the 
mind of the skilled craftsman who passed on through his apprentices, 
the skills and knowledge he had attained. Through all of this train- 
ing, there was continuous emphasis upon quality with the development 
and maintenance of quality attitude and pride of workmanship. 


Improving and maintaining quality in the process of reducing cost 
and price in the field of fair competition plays an important role that 
must be a part of the daily work habit of all employees, from the top 
management of the organization down through the ranks to, and including, 
those who do the most menial t. sks which are classed as non=skilled. 


You have undoubtedly attended a large fair or livestock exposition 
and watched the judging of animals which is a form of inspection, and 
also have watched the exhibition of showmanship by both 4H Club Groups 
and seasoned adult exhibitors. The exhibitor as well as the judges can 
only measure quality through attributes that can be observed by sight 
and feel. The exhibitors are mindful of this, and use all their ethical 
methods possible to emphasize these attributes of quality and use their 
cunning showmanship to accentuate the attributes associated with quality. 
In all the preparation of feeding, grooming and showing, the exhibitor 
has an ultimate goal, the gr: championship with its accompanying joy 
in the pride of accomplishment. dave you ever seen anything more 
gratifying than the smile of priae on the face of a winner? My only 
reason for bringing in the foregoing example is, because all of you 
have at some time or other experienced the same joy in pride of work- 
manship which culminated because of a quality attitude held during the 
period in which you accomplished your goal. 


To bring about the development of "Quality Attitude and Pride of 
Workmanship" calls for the use of all the principles, knowledge and 
techniques in the teaching and training of employees, and all the 
principles embodied in good human relations in the handling of 
employees. 


To develop a quality attitude and a pride of good workmanship calls 
for an intelligent program, put on by personnel having an understanding 
of human behavior. Alexander Pope, in his essay "Essay on Criticism" 
gives us an excellent guide in his statement, "Men should be taught as 
if you taught them not, and things unknown be proposed as things 
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forgot." This can be paraphrased in many ways, but in a very humble way 
tells us to forget about the personal pronoun "I," and to encourage 
doing on the part of others by sharing and even giving up some of our 
coveted recognition. The true recognition will eventually find the 
deserving person who has willfully dropped the use of the personal pro- 
noun "I," 


The techniques and types of programs best suited to develop a 
quality attitude and pride in workmanship will depend in part upon the 
size and character of the organization and industry, the size and 
character of the working sub-group, and the range in characteristics of 
the individuals. 


Our greatest aim must be to restore to the individuals, the pride 
of workmanship they had before the days of mass production. Through the 
years of our industrial history, we have developed the idea that we can 
influence persons in large groups if we have the right program and 
literature. This is in part true, but if we lose sight of the individ- 
ual and personal basis, we lose the effect of the binding kingpin, the 
individual who can unintentionally make or break the program to be 
developed. To really influence large groups in the development of 
quality attitude, one must deal with the individual on a personal basis. 
Any program that is to be used to influence workers should be geared to 
take in the individual. 


In the Training Within Industry Service, developed during World War 
II, the Job Relations Training emphasized the fact that supervisors get 
results through people. The foundations for good job relations were 
listed as; (1) let each worker know how he is getting along, (2) give 
credit when due, (3) tell people in advance what changes will affect 
them, and (4) make best use of each persons ability. To build best upon 
this foundation, people must be treated as individuals. 


Now then, what can we do if we are to treat the workers as 
individuals in a program of developing quality attitude and pride of 
workmanship? Suppose we assume that top management does not have to be 
sold on the value of a good quality control program in which job 
relations play such an important part. If this is true, the efforts can 
be expended in developing an attitude among the line staff and workers, 
whether they be in plant or in office operations. 


The personnel you have on hand would not be exchanged except, 
insofar as to make the best use of each persons ability. However, the 
personnel responsible for the selection and employment of people should 
through their interviewing and pre-employment testing in aptitudes and 
attitudes, evaluate the prospective employee in terms of job and quality 
attitude. Many employees are classified as accident risks. In a like 
manner, workers tend to classify each other on the basis of attitude and 
workmanship risks. If this information can be obtained from periods of 
previous employment of the new worker, it will be of material help to 
the supervisors during the training and orientation periods. 


The training and orientation program should emphasize development 
of a quality attitude and pride of workmanship with the same emphasis 
that safety, reduction in operating cost, employee health, and other 
points are now stressed as factors in the over-all picture of plant 
operations and handling of people. 
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Quality attitude and pride of workmanship are developed through 
accomplishment, primarily through proper handling of people. I am 
reminded of the youngster in the third grade in school who had just 
completed some written homework for the following days class. It was 
done in a very slip-shod fashion, with erasures and crossing out of 
words, which the parent suggested warranted re-copying to make a paper 
worthy of being handed to the teacher. The youngster, however, 
questioned the value of this effort and made the statement, "Why should 
he care how the paper looked as long as the teacher accepted that kind 
of work." Did not the statement of the youngster perhaps reflect the 
real attitude and lack of pride in workmanship on the part of the 
teacher? Contrast this example with another class from the same school 
a few years later under the influence of teachers who developed a feel- 
ing of pride in accomplishment among the pupils. This group gained an 
enviable reputation for neatness, completing of assignments, and 
accuracy through the setting of high standards of quality, and which 
when attained gave the pupils a pride of workmanship. Have we not seen 
examples similar to these in industry and everyday life? There is no 
difference in developing attitudes in industry from that in the class- 
room. The incentives are not always the same, but the principles of 
handling the individuals are the same. There is an inherent dignity 
among individuals, in which man's instinct to be a part of a group in 
which he can take pride in accomplishment is a factor to be recognized. 


The worker takes pride in belonging to an organization where the 
day-to-day actions of top management stimulate confidence and respect 
for the sincerity of their actions. This is true in regards to the 
day-to-day actions which reflect quality attitude, as well as those 
reflecting other attributes which are characteristic of a strong, live, 
organization. 


The individual on the assembly line takes pride in his work and 
company when he knows the product he is putting out represents the best 
efforts and knowledge of research, development, engineering, manu- 
facturing, control, purchasing, advertising, and sales. Effective 
employee relations are maintained with the minimum effort when the 
individual feels the pride of belonging; of belonging to an organization 
which has earned the right of loyalty to the company, devotion to the 
supervisors, and a pride of workmanship, because of the policies and 
actions which have gained the respect and confidence of the individuals. 


Perhaps we in the Pharmaceutical Industry are fortunate, and have 
an enviable position in the effort of developing quality attitude and 
pride of workmanship among the workers. This is because the management 
and workers alike, are mindful of the moral and ethical obligation to 
society through their products, in addition to operating under the legal 
standards set by the Food, Drug and Cosmetic Act, as well as specific 
requirements through N.I.H., A.R.S., U.S.P., and N.F. The Pharma- 
ceutical Industry is alert to its responsibilities and obligations to 
the public and medical profession. Through its various professional and 
industrial organizations, it is constantly keeping quality, its improve- 
ment and control, ever uppermost in mind during their deliberations. 
Through committees such as the U.S.P., Contact Committee, various 
committees in the A.D.M.A., A. Ph. M.A., etc., it has taken the 
initiative. Standards of quality set through regulations have become 
minimum requirements, just the same as in a good safety program the 
safety code becomes the minimum standard. The improvement above minimum 
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standards is up to the individual managements. 


A U.S.P. Sub-committee (4-13-49) in stating the general 
principles of Quality Control in the Pharmaceutical Industry said, 
"Pharmaceutical Quality Control is the operation of a system of checks 
and safeguards set up in a plant to assure integrity, strength, quality 
and purity of the finished products. The purpose is to assure that all 
merchandise shall meet professional requirements, all legal standards, 
and in addition, such standards of refinement and elegance as the 
management of a firm may adopt. It is the sum and substance of all 
effort which differentiates the manufacture of drugs and medicines from 
most other manufacturing projects, and makes of it an altruiatic pro- 
fession on a par with the practice of medicine." 


In any discussion of Quality Attitude and Quality Control, it must 
be remembered that the turning out of a quality product is not governed 
entirely by gauges, analytical balances, control charts, etc. A quality 
product is not the result of the manufacturing process alone. In this I 
am sure you will agree. You will agree that a fine pharmaceutical 
product could not be turned out, even with control chart methods, under 
unsanitary working conditions. An intricate instrument could not be 
properly assembled in a poorly lighted factory. Quality attitude must 
be evident in every phase of a business, from the production of the raw 
materials right down to the time the final product is used by the con- 
sumer. 


We are all familiar with the statement that quality cannot be 
inspected into a product. It must be built into the product. 


To build a quality product we are dependent upon management, men, 
materials, methods and machines. In dealing with the five "M's" the 
individual remains as the keystone to be fitted into the structure 
which is asked to support and produce the quality product to satisfy 
the boss, the final customer. It is no wonder that today the efforts 
expended to have an educated and enlightened individual as a worker are 
increasing. 


Much has been said and written about the American Economic System 
and the American Way of Life. Bringing the facts about this great 
heritage to the supervisory personnel, in order that they may better 
know the underlying principles of cause and effect, will do much to 
stimulate pride in our American Way of Living and pride in our 
accomplishments as a people of the North American continent. 


There is an alertness on the part of the management as to the 
importance of all phases of operations of any company, but it is not 
necessarily true that this is shared by all the line staff. Likewise, 
the knowledge of the relation and part played by materials produced or 
services rendered by the individual is not shared by the workers in the 
production of a product. 


The lack of knowledge of the importance of the individuals's work 
is often responsible for the various attitudes some not conducive to 
quality products. 


It is utterly impossible to have every person know the functions of 
all the other people in the various departments. Providing an 
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opportunity for those who are interested in hearing about how the work 
of other departments affect their operations, is a means of stimulating 
work interest. Through this work interest a feeling of pride of belong- 
ing is nurtured, from which pride of workmanship and quality attitude 
can be developed. 


In the development or production of a pharmaceutical product, many 
departments are involved. The ultimate consumer acceptance of the 
product, based upon quality as manifested in its various attributes, 
must be their goal. This can be attained when Research, Development, 
Engineering, Purchasing, Production and Control think and act with a 
Quality Attitude. A breakdown of this attitude by any one group will 
jeopardize the contributions of the other groups in the production of a 
quality product. Dosteoyevski once said, "If it were desired to reduce 
aman to nothing it would be required only to give his work the 
character of uselessness." Conferences which are democratic; not 
dictatorial, autocratic, or sectarian, bring about the discussions 
which lead to quality attitude and a quality product. 


In building a quality attitude and pride of workmanship, there is 
one group that is often neglected in recognition of its contribution to 
quality products. This group is none other than the maintenance work 
force. Basically they have the responsibility of maintaining equipment 
in condition for the maximum production of high quality material. Fore- 
men are prone to give a time limit for a change-over or repair, which in 
many cases results in poor quality maintenance, not as a choice on the 
part of the maintenance worker and set-up man. The maintenance worker 
is one who should be tactfully handled to develop a feeling of pride of 
workmanship based upon accomplishment. One must remember that he deals 
with people under the stress of quantity production. One minute he is 
dealing with an eager beaver who expects to become plant superintendent 
over night, and the next minute he deals with an old employee who never 
believes in change. There are few others in the organization who are so 
closely associated with cause and effect in production of quality goods. 
Look well to his suggestions of help for improvement. 


The effect on quality and quantity production is very evident when 
machines fail or fall into dis-repair. Quality Control can help prevent 
the maintenance man from falling into dis-repair with respect to his 
quality attitude and pride of workmanship, through its recording of 
machine performance for future evaluation. 


Just as machines perform better when the rated work load is not 
exceeded, quality production is likewise a greater probability when the 
human element is not overloaded because of faulty planning and schedul- 
ing. Pressure exerted because of extreme necessity for quantity pro- 
duction will usually be followed with increased cost of production due 
to increase of rejections. Lack of normal work load is also to be 
avoided. Busy hands and busy minds are essentials in development of 
quality attitude and a pride in workmanship. 


Perhaps we should stop at this point and ask ourselves, what is an 
attitude? Finlay, Sartain and Tate in their book entitled "Human 
Behavior in Industry" state, "an attitude is a tendency to act with 
respect to a certain value." It is an inclination we have for a thing, 
person, or idea. 











We cannot always judge an attitude by how a person acts because 
sometimes the true attitude is suppressed. Attitudes are under the 
surface and latent; possessing potential energies that can be directed 
to positive value under the guidance of a basic understanding of human 
nature. Some workers whom we consider non-conformists in quality 
attitudes today, are that way simply because they have not as yet 
attached a positive value to this concept. It is up to us in manage- 
ment and supervision to help the individual acquire the positive value 
through contact with our experiences. An attitude is not hereditary. 
It is acquired through contacts. 


If attitudes are acquired, can they be changed? Attitudes can be 
changed through the change in value system, because as the values change 
our attitudes change. 


Many positive values as quality attitude, have been determined by 
intellectual reasoning. The positive value attached to a quality pro- 
duct is something that the worker can grasp as a logical conclusion 
through directed reasoning. He knows there is a calculated risk in 
skating on thin rubber-ice on the pond. He can reason that proper 
lights are essential for night driving. 


Intellectual values are the easiest to change. Emotional attitudes 
are more difficult to change because with emotions it is how we feel 
about a thing rather than how we think about it. The workers best 
intellectual processes may tell him he is wrong, but he may still stick 
to his feeling about a thing. 


Development of quality attitude and pride in quality should be 
acquired through the intellectual process. Pride in quality of a pro- 
duct is a value to management as well as to workers on the line. The 
executives like to feel they are associated with the best product in 
their field. If this feeling and attitude toward quality is contagious, 
it is reflected in the work habits of the line staffs down through the 
ranks. The quality minded executive will not compromise quality because 
he has an attitude toward the value of the product. To the short- 
sighted, this may appear as a reduction in profit, but to those who see 
beyond the first horizon, there is the picture of harvest yet to be 
reaped. 


Those who work with people should know that this thing we call 
attitude is capable of changing. It becomes our duty to lay out the 
course. People react emotionally. Attitude like morale is difficult to 
create within a group, but sometimes so easy to lose because we forget 
the group is made up of individuals differing from one another in 
experiences which make up the values that can spell success or failure 
in quality production. 


In the development of morale, "the feeling of belonging," the 
individual feels the welfare of the group is more important than his 
own. If we can team up morale with pride of workmanship, a combination 
would be formed that would be hard to lick. To maintain this combina- 
tion day after day, calls for an intelligent supervisory staff backed by 
a@ management cognizant of developing "the feeling of belonging." 


Each one of you can perhaps recall instances in which a supposedly 
weaker team made up of younger unknown players, who had never made 
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success, captured the championship at the end of the season from the 
better known stars. Constant hustle and a will to win were the 
characteristics attached to the winners. This is an example of morale, 
a feeling of belonging; belonging to a group with a definite attitude, 
and a desire to act toward certain values. If, in our workers, this 
value is quality attitude and pride of workmanship, the morale will be 
high because the desire to win in this economic competition through 
constant hustle and alertness with intellectual reasoning will be an 
ever present incentive. 


You perhaps are disappointed, that in this discussion no definite 
program has been advanced for obtaining quality sctitude. Perhaps this 
is more intentional than accidental, because the program devised should 
be geared to the particular organization and its individuals. 


In the beginning of my discussion, attention was focused upon the 
individual. An intelligent and enlightened individual with enthusiasm 
for his work makes the best raw material for development of quality 
attitude. 


At Abbott Laboratories, employees at different levels have availed 
themselves of the opportunity to learn about Quality Control and 
Statistical Methodology, as an adjunct to Research, Production and 
Control. 


All the effort to develop quality attitude has not been expended 
in formal class training of a few. The success of Safety Training in 
industry has been due in large part to the constant publicity through 
posters, meetings, suggestion systems, special drives, etc. Quality 
attitude and pride of workmanship can be built soundly through the same 
methods. Recently, in our plant, following a very splendid two-day 
rally and demonstration of safety methods and equipment, the Production 
Quality Control Group placed "Push" and "Pull" signs upon various doors, 
propagandizing the "push" and "pull" for better quality and reduced 
cost. This idea borrowed from another company, met with much favorable 
comment because it had the effect of focusing attention upon quality and 
reduced cost to the same extent to which safety has been accustomed. 


In the beginning of this paper, mention was made of the importance 
of working through individuals. People are to be handled differently 
from machines, which after reading the instruction for operation can be 
handled very much alike. With human beings one must study, adjust, and 
set the examples by doing as you would have them do to develop the 
enthusiasm for your program. 


A program with an objective, clearly in mind, is the start. No use 
running until you know where you are going. This perhaps can be 
illustrated by telling the story about the boys attempting to make 
straight paths across a field of freshly fallen snow. One after another 
the boys stepped carefully along, placing one foot in front of the other, 
but with no great success. Another boy came along and with great con- 
fidence said he could make a better and straighter path than had been 
done. He set out at a fast, unhesitating pace, and when his path was 
completed, it was found to be straighter than that of the others. 
Pressed for the secret of his success, he replied, "Do you see that oak 
tree over on the hill? I just kept my eyes right on that and walked 
toward it without paying attention to the snow or anything around me." 
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This boy knew in what direction he was going. We, too, should know 
where we want to go. 


Let me close this paper by again quoting from the Essay on 
Criticism, "But where's the man who counsel can bestow, Still pleased to 
teach, and yet not proud to know." 





ELECTROPLATING APPLICATIONS OF QUALITY CONTROL 


W. E. Jones 
Management Controls 
Des Plaines, Illinois 


Electroplating is a word that has not been discussed in the Quality 
Control literature. A search through Industrial Quality Control revealed 
no articles on this subject. A search through 1956 and 1957 Abstracts 
of Statistical Quality Control was also fruitless. W. C. Geissman and 
R. A. Carlson have presented material on “Current Distribution in Barrel 
Plating. A Statistical Study", Proceedings, American Electroplater's 
Society, 1952. Material presented here has been taken in part from 
work done with Geissman & Oarlson at the National Lock Company, Rockford, 
Ill. Other than sinc barrel plating case studies are from unpublished 
material from various companies. 


The first six charts are limited to the particulgr field of sinc 
barrel plating. Additional charts touch on cadmium barrel plating. 
Comments will include case histories in the fields of galvanizing con- 
tinuous strands of wire, copper plating, chrome plating, and nickel 
plating involving all types of plating equipment. Contents of this paper 
are designed to stimulate further use of a new approach to the economic 
control of electrochemical plating processes. This paper is not designed 
to present a complete story on any single process. Much of the material 
presented on zinc barrel plating is restricted to only one barrel size 
at one revolution speed in one standard sinc solution. 


Barrel plating has often been regarded as a process that is very 
uniform since all of the parts are tumbled which randomizes the exposure 
of every part to solutions and current densities. It is almost a univer- 
sally accepted illusion thet barrel plating produces absolutely uniform 
plating thickness on every part. Statistical Quality Control methods are 
relatively new to electroplaters. Statistical analysis is the most effec- 
tive method for investigation of the many variables affecting electro- 
plating quality. 


Zinc barrel plating results were obtained from a fifteen barre unit, 
consisting of three tanks with capacity of 3 barrels each. Barrels were 
horizontal, hexagonal, 30 inches long, 14 inches across flats, with flex- 
ible dangler contacts, 2lmm perforations in Lucite panels, all made by 
same manufacturer, and all rotated at six revolutions per minute. Stand- 
ard procedures were used for cleaning and pickling parts. Power supply 
was a 4,000 ampere, 12 volt generator for each tank. Plating solution 
formula was: sinc cyanide 10 o2/gal, total sodium cyanide 16.5 o2z/gal, 
and caustic soda 14.5 oz/gal. Sodium sulfide was added to reduce metallic 
impurities. An organic brightner, RH-309, supplied by the Du Pont Company 
was also added to maintain brightness. Solution temperature varied from 
70 to 90 degrees F. Plating was followed by two rinses but no bright dip. 
Zinc barrel plated parts were a large variety of wood, machine, and sheet 
metal screws. Plate thickness was determined by drop test method, Hull- 
Strausser ammonium nitrate, using an automatic dropping unit made by the 
Kocour Company. This instrument delivers the standard 100 drops per 
minute. Identical areas on different pieces were always measured. 
screws and bolts, heads were always measured. 


Five groups of samples were generally taken from each barrel, spaced 
evenly along the axis of the barrel. This stratified method of sampling 
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permitted an analysis from end to center to end of barrel. There was no 
significant differences in the averages or the variances between groups 
of samples by location along the axis of the barrel. This permitted 
substantial reduction in sample sizes taken from each barrel. At the 
start of the study, 60 samples were taken from each barrel. This was 
later reduced to 25 samples from each barrel. 


Chart 1 shows the barrel average plate thickness plotted against the 
total wariation within each barrel (6 standard deviation value for variance 
within barrel). This proves that the variation within each barrel in- 
creases significantly as the average plating thickness increases. The 
line of best fit defines the relationship between these two variables. 
These results are only for the previously mentioned formula solution. 
Discussion of these results suggest further investigations on the effect 
of reducing the sinc cyanide progressively down from 10 oz/gal to as low 
as 3 o2/gal, keeping other elements of the solution about constant. With 
caustic soda and total sodium cyanide at about 24 to 3 times the sinc 
metal, it has been claimed that it is possible to run barrel plating so 
uniform that it is almost impossible to detect variation on the parts 
within a barrel. I have not found any published facts showing effect 
of such departures from "standard" solution formulations. It is also 
suggested that barrel revolution speeds be changed. Better uniformity 
has also been claimed at 3 revolutions per minute in 20 minutes instead 
of 6 revolutions in 30 mimites. 


Chart 2 shows no correlation between surface area in barrel load and 
total variation within each barrel. Chart 3 shows very little correla- 
tion between plating time and total variation within each barrel. Chart 
4 shows slightly more correlation between ratio of time/area and total 
variation within each barrel. 


Chart 5 is a simplified presentation showing minimum plate thickness 
assured in relation to average plate thickness. If a minimum specifica- 
tion of .1 mil must be met, it is necessary to plate an average of .22 
mils in order to guarantee that 99.87% of all the loads will have less 
than .1% under specification. To guarantee a minimum of .2 mils requires 
an average of .37 mils which is almost 90% more zinc than would be re- 
quired if all parts could be plated right down at the minimum. Most zinc 
barrel plating processes will range from 90% up to as high as 200% thick- 
ness above the minimum of .2 mils required. Production personnel were 
able to use this chart to reduce the average amount of excess zinc. At 
the same time, they obtained more uniformly plated parts for shipment. 

An Operations Research study is suggested at this point for consideration 
of all the economic factors involved in making the decisions necessary 

to reach optimum profit from the zinc barrel plating process. Any change 
in the process that reduces the within barrel variation immediately per- 
mits a reduction in the average amount of zinc plating needed to guarantee 
a minimum. These savings must be weighed against the costs of changing 
the process. 


Chart 6 shows the correlation between ratio of time/area to barrel 
average zinc plate thickness. Chart 7 shows the correlation between 
amperage and average cadmium plate thickness. These two charts are typi- 
cal of many simple comparisons that can be made to evaluate relationships 
existing on actual production equipment. Advanced multiple correlation 
and design of experiments are considerably more efficient techniques than 
the linear correlation studies graphically shown in this paper. These 
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advanced methods are essential to reduce the cost of analytical investi- 
gations where so many variables are involved. 


Charts 9 and 10 show an ordinary average and range control chart 
study of the strength of cadmium solution. One analysis is made weekly. 
Two successive weeks are used as a sample size of two to determine the 
average and range. This chart is typical of the improvements obtained 
on all solution characteristics charted: namely; copper oz/gal; sodium 
cyanide oz/gal; tin oz/gal; sodium hydroxide oz/gal; Ph; sodium carbonate 
o2z/gal; and cadmium os/gal. 


Conclusions 


Use of simple control charts and easily understood linear correlation 
analysis will produce excellent improvements due to their psychological 
impact. Use of advanced statistical quality control methods is essential 
to avoid the high cost of investigation. Average zinc plate thickness 
is correlated to an increase in the variation of zinc plate thickness 
within each barrel, for the solutions and manufacturing conditions de- 
scribed in this paper. New formulations will be developed and new ideas 
will find their impact on equipment that has not changed for many years. 
Statistical inve.tigations in the electroplating field will result in 
"better quality at lower costs". 
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QUALITY STANDARDS IN SHOE MANUFACTURE 
John T, Heald, Vice President and Factory Manager 


When this country was first settled, there were amogst those 
earliest colonists shoemakers whose task it was to provide footwear for 
their communities. These men were craftsmen of the first order, and 
they accomplished their skills with little more than a few hand tools. 


Today, more than 300 years later, we find the descendant of the 
original hand shoemaker still depending on his hands, eyes, and goad 
judgment when he is called upon to make fine shoes for men and women. 
Whereas cne might suppose that a considerable portion of the machinery 
used in the manufacture of shoes would be automatic, we find this not to 
be the case, Because of the great variations in the basic material used 
in shoes, namely leather, and the additional wide variations in the 
styling and construction of footwear, the shoe industry finds itself a 
far cry from automatic process machinery. The individual operator deter- 
mines through his skill, in equal measure with the machine, the quality 
of the product. Thus, with so many present, it is of the utmost impor- 
tance that some sort of definable fence be built around these variables 
to prevent them from running wild on occasion. 


This becomes even more significant in the segment of the industry 
in which we, The Stetson Shoe Company, find ourselves competing - namely, 
high grade mens' shoes, retailing for about $25.00 per pair and up. It 
may be confidently and categorically stated concerning this area of the 
industry, be it mens' or womens' shoes, that the consumer expects a great 
deal of the product he purchases, as well he may. Were his standards of 
acceptance to be carefully analysed and segregated they would separate 
into the following: 


- durability; wearing qualities 
- comfort; fitting qualities 
- style; eye appeal 


To engender uniform and maximum customer acceptance and satisfaction 
has been the prime objective of our Quality Control Program since its 
inception three years ago. Again recognizing the tremendous variables 
present, which so often have been in the past responsible for variations 
in customer acceptance, we felt that the object of our efforts must be 
the attainment of Uniformity and Consistency of product. 





It might aid to shed further light on this problem to define the 
components of our product quality: 


(Chart A) 
In order to assure maintaining the optimum level of custamer 
acceptance in terms of these components it has been necessary to estab- 
lish the following steps in analysing the product, a newly styled line 


being developed every six months and involving some one hundred new 
models or styles: 


(Chart B) 
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In addition thereto constant vigilance is maintained over merchan- 
dise returned by customers for refund or repair. By this means another 
check is made of established standards designed to assure customer 
acceptance and satisfaction in the areas of material durability, foot 
comfort, and appealing design. 


It would be fair to state, in passing, that the average person ex- 
pects far more in terms of performance from his footwear than he does of 
any other item of apparel. Thus, the most exhaustive tests mst be ap- 
plied to the product to assure customer satisfaction; it is not at all 
unusual for customers to return shoes for dissatisfaction with one or 
more of the above quality canponents after months of obviously punishing 
wear, 


Therefore, in an industry where it has been custamary in the past 
for manufacturing personnel to carry quality standards about in their 
heads, permitting unhappy variations in the product to often occur be- 
tween the time of product creation and its actual delivery to the cus- 
tomer, it became necessary for us to define these standards in order that 
everyone in the organization might understand and be in complete agree- 
ment upon them. 


This has resulted in the development of a program which establishes 
standards for the following areas in the manufacture of the product: 


- purchased materials quality 
- process control standards 
- finished shoe quality 


Standards relating to purchased materials quality might be segre- 
gated into two groups: 


1. Leather quality requirements, which pertain to material used in 
the upper portion of the shoe primarily, and which accounts for 
better than 50% of the material cost of the shoe, and 


2. quality requirements for other materials, which include heels, 
welt (a basic ingredient of the goodyear welt construction, common 
to most mens' shoes) and other materials such as nails, threads, 
etc. 


As concerns the latter category, tolerances as to texture and 
measurements of incoming materials have been established, suppliers have 
boen advised of such, and samplings are made of each incoming lot. This 
has served to reduce considerably the reworking and repair of shoes in 
process which previously resulted from sub-standard materials. 


The development and application of standards for upper leather 
materials, the major expense item of the shoe, will require further study 
before optimum controls may be considered to be in effect. This is due 
to the extraordinary variations in the animal skin itself, as well as the 
variations which occur between the several tanners in the grading of 
these skins. It can be reported that based on the fact that grading of 
the leather determines its price, a careful study of the relative cutting 
economy of the various grades of each tanner has indicated clearly that 
considerable savings can be effected, on a predetermined plan of purchase, 
by purchasing specific grades of leather for specific patterns or styles. 
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Continued refinement of this study of quality standards in grades of 
leather will result in very sizeable economies. Results in this area 
have been unsatisfactory in the past because of the lack of a carefully 
devised control plan. 


Process control standards have been established with regard to 
critical operations, variations in which have in the past resulted in a 
finished product lacking in the desired uniformity, to the extent that a 
certain portion of these would be considered unacceptable and therefore 
rejected. Such in-process inspections or samplings are made daily, and 
unacceptable deviations reported to departmental foremen when detected, 
permitting corrective action to be taken @m the spot, so to speak. In 
addition, weekly reports indicating quality performance in these criti- 
cal areas are made to foremen and the plant manager each week, so that 
adverse trends in performance may be determined and the necessary action 
initiated, As such, the in-process checks provide an invaluable addi- 
tional tool to the foremen and plant management in maintaining uniformity 
of the product. 


Perhaps one of the most important areas over which standards have 
been established is that of Finished Shoe Quality. Herein a careful 
endeavor has been to define stancards in terms of what the qustomer con- 
siders desirable and undesirable in the product, avoiding assiduously 
the pitfall of viewing the finished shoe quality from the viewpoint of 
the manufacturer rather than that of the consumer. [Experience has in- 
dicated that an unintentimal gulf can sometimes exist between these 
viewpoints. Thus defects are- segregated as to severity, and are noted 
as either "major" or "minor", This differentiation has happily resulted 
in an appreciation that what may appear to the manufacturer as "major", 
may be construed by the customer to be "minor", and vice versa. The 
results of this audit, as is true of other samplings, are reported to 
manufacturing management, and in the process of taking necessary cor- 
rective action the customer viewpoint is properly balanced with and 
emphasized in relation to the manufacturers viewpoint. 


It should be mentimed at this point that the establishment of these 
processes and finished product standards has served effectively to per- 
mit optimum control over the most elusive element with which apparel and 
footwear manufacturers in general must constantly deal - that of "style" 
or the artistic quality of the product. Sales and Production management 
work hard and long towards the objective of developing a product which 
has "eye appeal", the particular quality which the manufacturing depart- 
ment has the most difficulty in constantly and uniformly interpreting. 
In a so-called high grade and expensive product, the customer rightfully 
expects, (consciously or unconsciamsly) this quality feature to be 
pleasing to his taste. 


Because of the unquestioned importance of finished shoe standards, 
only a slight deviation from the goal is permitted as acceptable, and 
results have indicated an ability to produce a finished product 98.6% 
free from major defects, 


Since the desired results of any such Quality Control Program ulti- 
mately depend upon the understanding and response of all persons involved 
in the manufacturing process, management and labor alike, every attempt 
is made to keep such people constantly advised concerning the status of 
quality performance. Conferences are held periodically with manufac- 


353 








turning supervision, at which time performance is reviewed and plans are 
made to undertake effective corrective action. Quality Control best 
serves its purpose whén the Foreman regards it as an effective tool and 
help in enabling himto best fulfill his quality responsibility. 


In addition, a constant attempt is made to remind production em- 
ployees of their responsibility in this area and the beneficial effect 
which proper observance of this responsibility has upon their job secur- 
ity. This phase of the Quality Control Department's activities, which 
can be designated as a quality-mindedness program, is effected through 
Quality posters, pay check messages, articles in a monthly newspaper, 
and employee plant inspection teams. The latter, which are camprised of 
selected employees from each department and their foreman, are briefed 
on the particular problems being encountered, and the significance 
thereof. They accompany the Quality Control Director on an inspection 
of the entire plant, and are asked to observe practices or conditions 
which may result in inferior quality performance. Over a period of time 
this serves to acquaint many employees with the relationship of their 
own tasks to those in other departments, and impresses them with the 
necessity of a high level of quality performance in the finished product. 


In summary, the establishment of and control over standards in Shoe 
Manufacture has for this company greatly improved our ability to deliver 
a consistent and uniform product to the retail customer and hence to the 
consumer, an accomplishment which should help to inspire confidence on 
their part in the product and company. 


It has also become evident that appreciable cost reduction may be 
realized through a Quality Control Program, despite the fact that the 
costs of maintaining quality have been historically low in relation to 
Sales Dollars. In addition to the savings resulting from reduction of 
in-process repairs and rejects, and customer returns, it is obvious that 
comparable economies may be enjoyed through purchase policies based on 
proven knowledge of the leather suppliers' grading methods. 


In one of the country's oldest industries, steeped in tradition, 
quality standards have brought to this canpany an invaluable new vitality 
and confidence, which should continue to pay dividends in today's can- 
petitive market. 


COMPONENTS OF PRODUCT QUALITY 


Stetson Shoe Company 




















Quality of Quality of 
Basic Quality of Shoes Design Conformance 
Pleasing Style X 
Fine Materials X 
Uniform Appearance x 
Comfortable Fit X X 
Long-wearing “ualities X X 

(Chart A) 
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oTEPS FOR QUALITY ANALYSIS: 


(Covering new products and selected old products) 


Examination of materials for durability, work- 
ability, strength, color fastness, odor and 
uniformity. 


Sewing tests on materials to determine optimum 
stitches per inch, together with evaluation of 
strength and appearance of sewed material. 


Tests of increased severity on assembled shoes 
to determine points of weakest strength or 
potential damage, 


Controlled wear testing of shoes with reports 
every 100 hours of use, om the basis of Form A. 


(Chart B) 
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SYNTHETIC SAMPLING 


A Way to Predict Circuit Reliability and to Automate Design 


by 


L. Hellerman and M. P. Racite 


ABSTRACT 


A new approach to the problem of designing reliability into an electronic 
circuit is the method of synthetic sampling. This method is superior to 
the commonly used "worst-case" design philosophy, which is character- 
ized by the requirement that the circuit operate reliably when the com- 
ponents are at specified adverse limits (Reference 1). We shall point 
up this superiority by considering several shortcomings of the worst- 
case philosophy, as follows: 


1. It makes no distinction between the probability of mean and 
extreme outputs. 


2. If a circuit can fail in more than one way, no distinction is 
made between the probabilities of failure from the different 
causes. 


3. It provides no measure of reliability. 


Synthetic sampling does not have these weaknesses. It autornatically 
weighs each output with its probability, and it gives the reliability of the 
circuit based on the actual component distributions. The method is il- 
lustrated in this report by application to a typical transistor switching 
circuit. 
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SYNTHETIC SAMPLING 
A Way to Predict Circuit Reliability and to Automate Design 


by L. Hellerman and M. P. Racite 


INTRODUCTION 


This paper presents an application of the method of synthetic sampling, 
or Monte Carlo, to the problems of reliable design and quality control 
of electronic circuits. 


We believe a design procedure using synthetic sampling is far superior 
to the commonly used "worst-case" design philosophy, which is char- 
acterized by the requirement that the circuit operate reliably when the 
components are at specified adverse limits (Reference 1). We shall 
point up this superiority by considering several shortcomings of the 
worst-case philosophy, as follows: 


1. It makes no distinction between the probability of mean and 
extreme outputs. In view of the nature of the output distribu- 
tion, this is often unrealistic, unduly pessimistic, and costly. 


2. Ifa circuit can fail in more than one way -- for example, 
from too high an output and too low an output -- no distinction 
is made between the probabilities of failure from the different 
causes. This unbalance in reliabilities, as an example will 
show, may actually make it desirable to design failures from 
one cause into a circuit purposely, in order to increase the 
overall reliability. 


3. It provides no measure of reliability. The specification of 
limit component values for the worst case is somewhat arbi- 
trary, and the designer never knows how much reliability he 
has bought by using one or another set of extreme component 
values. 


Synthetic sampling does not have these weaknesses. It automatically 
weighs each output with its probability, and gives the circuit output 
distribution and reliability based on the component distributions used. 
An automatic design procedure using synthetic sampling on the IBM 704 
will be described and applied to the design of a typical transistor switch- 
ing circuit. 


In the quality control of an electronic circuit a manufacturer may have 
the following problem. A certain lot of components to be used in the 
circuit is tested and found to be off specification. If these components 
are used anyway, will the effect on the quality of the finished circuits 
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be such that most will have to be discarded, or will the effect be mar- 
ginal, so that most of the circuits will be satisfactory? Here again, as 
we shall see, the method of synthetic sampling is an excellent way to 
find the answer, for all we need to know is the circuit output distribu- 
tion based on the off specification component distributions. 


NOTE: 


This paper was presented in part, at the Fourth National Symposium on 
Reliability and Quality Control in Electronics, in the session on "'Qual- 
ity Control for Reliability, '' Washington, D. C., January 8, 1958. 
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1. IMPROBABILITY OF EXTREME CASES 


Design engineers have questioned the worst-case criterion on the 
ground that the occurrence of the worst case may be quite improbable, 
and the requirement that the circuit operate under these conditions 
may be too costly for the reliability that it buys. Statisticians may 
recognize the Central Limit Theorem in this phenomenon (Ref. 2). 
Our first object is to formulate a measure of it - some rule of thumb 
to back up the engineer's intuition. 


Suppose we have a circuit whose performance parameter y is a func- 
tion of several circuit parameters, 


y = f (x), Res 4 x,)- 


We assume the design engineer knows the distribution of each compo- 
nent. This information may be in terms of the distribution of the de- 
viation of the component from its nominal value. 


The deviation of the performance parameter is given approximately by 
the total differential formula, 


of of 
dy = 3*) dx) i509 ox, dx_- (1) 
Using the fact that the distribution p; of 


of 


is related to the distribution fj of dx; by 


l f; dy; 

P; (dy;)= of s,s (3) 
Ox; as, 
1 


the problem of estimating the likelihood of some deviation becomes the 
problem of estimating the sum of several random variables, 


dy = dy) +... dy,. (4) 
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This can be done by means of the composition of distributions formula 
of statistics: 


+@ 


p(z) = f(x) g(z - x) dx (5) 


-® 


where z=xt+y, fis the distribution of x, and g is the distribution of 
y- 











nm|— — = = = oe on 


FIGURE 1. Distribution of z = x+y. 


Fig. 1 illustrates the distribution of the sum of two random variables, 
each uniformly distributed from - 1/2 to + 1/2, as found by the above 
formula. The result agrees with intuition: The probability of the sum 
being near -1 or +1 is less than the probability of the sum being near 0. 
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If we now let f be the distribution we have just found for the sum of 
two variables, and reapply this formula, we get the distribution of a 
sum of three variables, and so on to n variables (see Appendix). The 
result of all this is the following: 


If the deviation of an output can be considered as a sum of 
n uniformly distributed and approximately equal random 
variables, then the probability of a deviation falling within 
an extreme nth of the total variation is roughly 1/n!. 


It is clear that this probability decreases as the number of components 
increases. 


2. UNBALANCED RELIABILITIES 


We now consider a second blind spot of the worst-case design philoso- 
phy: If the circuit has more than one output, there is no indication of 
a possible unbalance of the reliabilities with respect to each output. A 
circuit may be designed to operate when any particular combination of 
its elements are at the limits of their initial tolerances or end-of-life 
tolerances, but this does not insure that its mean output will be maxi- 
mally distant from all regions of malfunction. Related to this is the 
fact that the nominal value of an output (obtained by inserting the nomi- 
nal values of the components in the circuit equations) is not the same 
as the mean value of the output, intuition notwithstanding. On the con- 
trary, if y = f(x), ... , x,) and x; are independent, the relation be- 
tween nominal value and mean value is given by a Taylor series whose 
first terms are (Ref. 4). 


Note that if the function is linear, mean and nominal values are the 
same. 


The discrepancy between nominal and mean values is not merely of 
theoretical interest, confined to unusual circuits rarely encountered in 
practice. A close examination of the simple series circuit in Fig. 2 
illustrates this difference. 


Let E = 1 volt and R = 1 ohm, and suppose that R is uniformly distrib- 
uted from 1/2 to 3/2. In this case the nominal value of I is 1, but the 
mean value is ln 3. The probability density function is 1/12, as shown 
in Fig. 3. (See Appendix II) 
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Thus the probability of a failure because of too small a current is 
greater than the probability of failure from excess current. If a de- 
sign engineer knew of this unbalance, he could compensate for it by 
choosing R less than 1. 


3. MEASURE OF RELIABILITY 


A third difficulty with the worst-case design philosophy is that it pro- 
vides no clear measure of reliability. An engineer using this method 
will have the satisfaction of knowing he has considered reliability, but 
he will have no measure of the effectiveness of this consideration. 
This is really the key question of the entire reliability problem. To 
answer it we need the output distribution. This in turn not only gives 
a measure of reliability, but also tells us how the output is distributed 
between its extremes. 


There are several methods in the literature for estimating the distri- 
bution of the output of a circuit. A rather popular one equates the 


mean to the nominal value, and estimates the standard deviation of the 
performance parameter y from 


2 n 


Then, assuming a normal output, its probability density is given by 





p (y) = ! exp - (y - ¥) (8) 


vr © 2 ¢? 
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This has been called the "propagation of error technique, " because of 
the application of Eq. (7) to computation of precision measures. (Ref. 
5, 6, 14.) 


A second method, also using the linearity assumption, is based on the 
composition of distributions formula (5). This has the advantage over 
the propagation of error technique because we are not restricted to 
normal distributions. In the study of the extremes of the distribution 
function, the normal curve would have been quite unsuitable. Its tails, 
trailing off to +060 , would have submerged the very thing we wished 
to observe. 


THE METHOD OF SYNTHETIC SAMPLING 


The method that we have found best for ease of application, adaptabili- 
ty to available component data, and accuracy is Monte Carlo (Ref. 7). 
In general, the essence of Monte Carlo methods is the conversion of a 
problem to some equivalent statistical one and then solving this experi- 
mentally, by observing a large number of cases. A classical example 
is integration (Ref. 8). Our problem is a natural one for Monte Carlo, 
for it is essentially statistical to begin with, and needs no conversion. 
For this reason it has been called synthetic sampling. 


Suppose we wish to obtain the output distribution of 


y= f(x), ere 9 Xn) 


where the x; are distributed in a known way from a minimum of xj, to 
a maximum of xjz. We assume that the cumulative distribution func- 
tion cj of the i th component is such that, corresponding to any ran- 
dom number between 0 and 1, there is a unique value of the parameter. 


The flow chart for the synthetic sampling method is then shown in Fig. 
4. The yj'S are a set of test points for the distribution. When the test- 
ing is over, column 0 will contain all the cases less than yp, column 1 
will contain the number of cases between yg and y;, and so on. Note 
that the y; need not be equally spaced, but can be chosen to highlight 
any particular region of the distribution. 


You may ask, what faith can we have in answers obtained in this way- 
by chance, so to speak? How do we know that we can trust the random 
number generator to be unbiased? The answer is that there are statis- 
tical tests that can be and have been applied to these questions and that 
tell us with what confidence we may believe the results. Studies of 
this sort should be part of a proper application of the synthetic sampl- 
ing method. 


However, if possible, it is always comforting to test the method ona 
problem for which we know the answer. For example, using the com- 
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position of distributiors formula, we have computed the distribution of 
the sum of four variables, each uniformly distributed from 0 to l. 
This gave the theoretical probabilities for z falling into each interval 
of length 1/2 from 0 to 4. The distribution was then found experimen- 
tally. A comparison is shown in Fig. 5. 
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FIGURE 5. Distribution of z = x] + x2 + x3 + Xq-. Com- 
parison of theoretical distribution for 384 cases with syn- 
thetic sampling of 38,400 cases normalized to 384. 


EXAMPLE 


The method of synthetic sampling has been applied to the analysis of 
the switching circuit shown in Fig. 6. This circuit was originally de- 
signed using the worst-case philosophy, in which the transistor par- 
ameters were at initial purchase tolerances. In our testing we let all 
parameter distributions c. be uniform with end-of-life limits. Note 
that any other available component data could have been used as well. 


Two performance parameters were considered: 
1. The voltage difference V, - V,, when the circuit is ON. 
2. V. - V,, when the circuit is OFF. 


For reliable operation these had to be greater than zero with some 
noise allowance. 


Since we were particularly interested in the nature of the distribution 
in the neighborhood of zero - for this is where failures will occur - we 
chose our test points accordingly. The results of 100, 000 tests are 
shown in Fig. 7. Note the unbalance between the ON and OFF cases. 
When ON, no case came near failing. When OFF, 693 cases were be- 
low the allowed noise level. This was a result which could not have 
been foreseen by simply considering worst cases. 


In an effort to equalize the ON and OFF cases, the resistance R3 was 
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decreased from its original design value of 2. 7k. As R3 decreased, the 
ON case became slightly less reliable while the OFF case reliability 
improved. The optimum value appears to be between 2. 3k and 2. 4k, as 
shown in Fig. 8. With this value, and with the parameter distributions 


we have assumed, the probability of failure from parameter drift is 1 in 
10,000. That is, the reliability is 0.9999. 
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AUTOMATION OF DESIGN 


In the last section we were given a circuit design and component distri- 
butions and found the output distributions and reliability. By trying 
various values of R3 we altered the output distributions and improved 
the reliability. We will now outline an automatic design procedure using 
the method of synthetic sampling. 


Our design problem is the selection of the values for Rj, R2, and R3 in 
the switching circuit shown in Figure 6. Besides the two performance 
parameters considered previously for the ON and OFF reliabilities, we 
wish to consider the possibility of exceeding a maximum OFF bias. 
Thus we will have three performance parameters: ON CASE, OFF 
CASE, and MAXIMUM OFF CASE. Corresponding to each case there 
will be an output distribution which may be found by synthetic sampling 
following the flow chart shown in Figure 4. 


Without loss of generality we may assume failures correspond to nega- 
tive values of the outputs. (This can always be arranged by adding a 
suitable constant to the output, or by changing its sign.) Figure 9 then 
shows the three distributions for some starting resistance values. Our 
object is to find the resistance values that minimize the number of fail- 
ures. Clearly, in order to do this we have to shift the worst case dis- 
tribution to the right. 


Our present program has a built-in intuition for selecting the parameter 
change to cause this distribution shift. For example, we know that the 
OFF case output is most effectively shifted right by decreasing R3. 
Hence, if our program recognizes the OFF case as the worst case, it 
will decrease R3 a small amount. This should improve the result and 
we may now test the distributions obtained with the new parameter 
values. Perhaps a different case will now be found worst. The program 
selects the parameter change to improve this, and so on, until the de- 
sired reliability is attained. 


Various refinements suggest themselves. It is unnecessary to use 
Monte Carlo to test every change of the resistance values. Instead we 
may use the shift of the nominal value as a measure of the shift of an 
entire distribution. This saves time, since it replaces computation of 
the large number of samples necessary to find a distribution by compu- 
tation of the single nominal sample. However, since nominal value 
shift is only a rough indication of distribution shift, we must "touch 
base" every so often with Monte Carlotests. These serve as the basis 
for further changes. A flow chart for this procedure is shown in Figure 
10. 


Our program, shown in the flow chart has a further refinement. The 

more reliable the circuit becomes, the more current it will require 
from prior circuitry. Therefore, we want our design to minimize fail- 
ures only within the capability of the circuit. This may entail shifting 
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some distributions left and others right in order that their left endpoints 
line up at some desired position. This position may be such that a pre- 
determined number of failures will occur at end-of-life of the circuit. 


Another refinement is an automatic means of selection of the parameter 
change for improving a particular case. The total differential formula 


oz Oz Oz 
dz = dR, + 22 aR, + dR 
aR} i @R2 2 @ R3 3 





may serve to make this'selection. If we let dRj be a fixed percentage 
of R; the relative magnitudes of (Oz/29R;) dR (i = 1, 2, 3) give the 
effectiveness of Rj in altering z. The sign of the partial derivative 
tells whether an increase or decrease of R; is needed to increase z. 


IN CONCLUSION 


To summarize, the worst-case design philosophy says: "Let the cir- 
cuit operate when the circuit parameters have their extreme adverse 
values."' In view of the shortcomings of this approach already discus- 
sed, we propose that it be replaced with the design philosophy which 
says, "Let the circuit operate with probability R when the parameters 
have their probable values.'' Synthetic sampling seems to be an excel- 
lent way to implement this philosophy. 
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APPENDIX I 
Let us denote the sum of n statistically independent variables (each 
uniformly distributed from -1/2 to + 1/2) by x,, and the distribution of 
x, by f,. The composition of distributions formula, Eq. (5), then gives 
f> (x>) = fy (x) g (xz - x) dx (9) 


- 0 


where f) = g, and x, =x. With the distributions we have chosen, the 
independent variables x and x5 - x are less than 1/2 in absolute value. 
Thus 


min {x2 +1, l j 


2 2 


f, (x,) = dx 
max {*2 » 2 ss } 
2 2 


and considerations of cases gives 


x2 +1 -1 Sx, 40 


£ 2(x2) = 


-x2 + 1 0x, +1 


We are interested in the nature of f, in the neighborhood of its nega- 
tive extremity, -n/2. Clearly, from Eq. (9), 


A 


f, (x) = dx -1 x, 20 


“in 
2 
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Suppose 














x, + £ > re x eS x5 + ns 
n-1" > *-2" 3 Sina. me 
f{ -]} (x, 1) = { owe dx dx,. dx ai 
2-n 3-n -1 te 1 
2 2 2 
P l-n < < 3-n 
or 2 = *x,-1 = 2 
Then, by Eq. (5). 
© 
f(*,) = = (x) g (x,, -x) dx 
- 0 
where f,,_, and g are zero outside the ranges 
6.23 1 
ye 
and 
len & < 3-n 
x= 
2 2 
Hence 
: l 3 -n 
+_— , 
min {, o 7 } 
‘. (x,,) = fn-1 (x) dx 
max l-n l 
_ x = «ane 
< < 
For -—=x = - + + 1, this gives 
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i l l l 
x, + _ Xn-1 + wo x, + > x2 + 5) 
f,, (x) = J | ae dxdx,...dx, 
a noe of 1 
2 
ao «a. Ga. BS Acd. 
2 n 2 


Differentiation of this gives 


/ 
ad (x) = ts (x 


l 
n-1 * =) 


and for the (n-2) th derivative we have 


(n-2) (n-3) (1) 


f (x) =f, (k++) See =f, (xt B5d pas, (xt B52) 


(n-2) 


Since f, (-1) = 0, it follows that f, (- =) = 0, similarly for the 


lower order derivatives at - + - Also since 


(n-1) 


‘ 
f (x) = £, (e+ BL) = at x=- > 


we may infer that the order of contact of £,,(x) at the point - >: with 


the x-axis, is n-2. Furthermore, since f, is a polynomial of degree 
n-l, the nth order and higher derivatives are all zero. All this says 


n-l 
(x + 3) < 


f£ (x)= a for - > ; 
” (n-1)! 2 
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and the probability of falling in this region is 


é2-n 
2 


(n - 1)! 


volts 


which is the result we wished to prove. 


Alternative methods of arriving at the same result are given by 
Silberstein (Ref. 13) and Parker (Ref. 12). The article by Berz (Ref. 
11) is also of interest in this connection. 








APPENDIX I 


Let the cumulative distribution of R be P(R). Since R is assumed uni- 
formly distributed from 1/2 to 3/2 we have 


R 


or 


- 2 


= sin —_ .% 
P(R) = LéroR-—, > 
i 
2 


The probability that the current is less than or equal to I is the same as 
the probability that R is greater than T’ because of the inverse rela- 


tion between these variables. Hence, the cumulative distribution of I is 


= “ihe si - eo mh - Red. - 
P(I) = P {a > ~} 1 - P(R) R= ; 


The probability density of I is 


dP ; l 
dl T 
The mean of I is 
2 
rs 1 +t dI = In 3 
= 4 
- 
3 
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CONSUMER RESEARCH AND QUALITY CONTROL 


Robert Weller 
Alfred Politz Research, Inc. 


CONSUMER RESEARCH OBJECTIVES 





The objective of consumer research should be to enable the marketer 
to build his sales economically at the rate at which he buys consumer 
research. To accomplish this goal, consumer research must have certain 
characteristics. 


1. It must cover all aspects of marketing -- product, distribu- 
tion, merchandising, advertising, pricing, etc. Advertising researc! 
cannot be a separate entity and separate from consumer product research, 
and still be productive. All phases of marketing are too interrelated. 


2. It must be creative -- not just informative and entertaining. 
Productive research today even gives direction to management regarding 
the kinds of products to produce in advance of production. Conducting 
this pre-product research does not mean discovering what consumcrs 
want, but rather what consumers will accert. In analyzinz the 
marketing, it doesn't take much rescarch to find out that the obliga- 
tion of the marketer is not to give th: consumer merely what the latter 
wants. The consumer wants cverything at the lowest possible price, 
and, at the moment, the consumer hss no iraginetion, not cven for his 
own benefit. The conswner depends upon the creativeness, inventive- 
ness, and pioneering of industry, and the markcter has to come in and 
make the consuncr want what has been invented and what thc marketer 
desires he should want. That's what is called selling, and that's 
the nutshell of all marketinz operations. 





} 


- 


uppose we had conducted typical "find-out-what-he-wants" 
surveys in 1800 and asked consumers to tell us what to do to improve 
their products, conveniences, etc. If such opinions had been 
gathered and followed, after these meetings all of us would ride back 
to our various offices in the finest horse-bugzys you ever saw, 
reading, under the most efficient kerosene lamps, newspapers that 
contain news five days old. 


“4 
oO 


> 


This does not mean that we do not study consumer's wants. We 
study them to help us learn how to create new wants and desires. 


3. It must be predictive -- To be creative, consumer research 
must be able to predict. Prediction involves risks, including the 
risk that the possible inadequacy of the research will eventually be 
detected. Some think this risk can be avoided if research just con- 
fines itself strictly to descriptive research. This concept, however, 
is both selfish and illusionary; it does not change the principal 
issue. It only suggests a division of labor in which the professional 
researcher is careful not to participate in drawing productive con- 
clusions from the statistical data he collected. The conclusions, and 
therefore, the predictions, are then left for somebody else to make -- 
the sales executive, the manufacturer, the company research director, 
or management in general. The point still remains that consumer 
research is not useful unless it leads to predictions. 
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Guiding manazement means providing a basis for making decisions. 
A decision entails the expectation of a specific result; therefore a 
decision entails a prediction. Worthwhile research arrives at the 
causal conclusion that action A will lead to result. B. A decision can 
then be made to take action A, with the knowledge that result B will 
occur. Here is where valid research can increase tremendously 
management's knowledge level and lower management's risk. Of course, 
whether action A ought to be taken or whether result B is worth the 
extra effort is not the problem of the researcher, but of management. 
This is so because management not only must consider the consumer 
angle, but also company policy and philosophy. Often these are of a 
higher order than simply the consumer aspect. 


4. It must use experimental design -- To be predictive, consumer 
research must be conceived and executed as an experiment. The survey 
is but our laboratory in which we put to test our ideas of our products, 
our advertising and our merchandising. It's where we test reactions -- 
reactions of consumers in an effort to learn more about their attitudes, 
their motives and their behavior. Being conceived of as an experiment, 
consumer research then must start with an hypothesis, it must use or 
create a mechanism for testing the hypothesis and then must draw a 
conclusion about the acceptance or rejection of the hypothesis. The 
research task then is an exercise in experimental design, and the 
design of experiments requires great creativity and imagination. The 
consumer researcher cannot hope to have statistics or psychology or 
some other discipline do this creative work for him. Engineers make 
use of various sciences, but it is the existence of specific design 
problems which makes engineering a science of it's own. Consumer 
research, too, and predictive research in particular, has it's own 
unique design problems which should make it a science of it's own. 


THE PRODUCT OF CONSUMER RESEARCH 





Having these characteristics, what then is the "product" 
consumer research offers to management? The product is basically 
"information." Information in the form of recommendations of suggested 
management action, and in the form of conclusions based upon the data 
collected. The data collected are, in a sense, created by the re- 
searcher. They are created because they are reactions to stimuli we 
put to the consumer. The researcher creates the questionnaire. But 
what validity can we attach to these data? In short, how do we 
control their quality? 


THE FACTORY THAT PRODUCES CONSUMER RESEARCH DATA 





The factory that produces the data for consumer research is, 
of course, the survey itself. There are many "processes" and "machines" 
that help produce a survey. 


First, is the design of the sample and the actual carrying 
out of this design in the field. The utilization of probability 
sampling has given consumer research the foundation and framework within 
which it can operate close to the level of the pure sciences. Without 
such a framework, validity could not be insured, and reliability 
statements could never be made. Alfred Politz introduced probability 
sampling to the commercial field and has been a major influence for it's 
growth among consumer research organizations. 
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A second part in the survey production is the questionnaire 
itself, which is perhaps the most important piece of machinery in our 
factory. Third is the field -- interviewing. Fourth is editing the 
questionnaires and coding the questionnaires for translation to 
machine tabulation; fifth the mechanical translation of the information 
to punch cards or tapes and sixth the machine tabulating. Seventh is 
the computions and eighth is the analysis which includes recommenda- 
tions for management action. Each of these parts can be adapted to 
quality control techniques. 


The relative importance of each of these parts and how they 
are balanced in reference to their cost and their contribution to the 
overall quality of a survey, is discussed in the Appendix of a study 
we conducted for Life Magazine, called "A Study of Four Media." It 
is contained in the chapter called "Balanced Accuracy." 


Today I shall limit my remarks to just one of the eight steps 
above, "Field Interviewing." In this discussion, we shall assume that 
we have a properly executed probability sample and that our questionnaire 
is a properly functioning piece of machinery. The questionnaire is put 
into this working order by adequate pre-testing before the survey is 
launched. 


The importance of field work has been discussed and examined 
many times in literature. There have been several investigations that 
have actually quantified the contribution that interviewer variation 
makes to the total mean square error in a survey. One was a paper 
given by Steve Stock, called "The Analysis of Interviewer Variance In 
Block Samples." This study was conducted in a single city, with blocks 
used as primary sampling units. Of the total variance, the major 
portion was contributed by interviewer variation, and equalled approx- 
imately 58%. 


ADMINISTRATIVE CONSIDERATIONS 





The most important contribution to the assurance that quality 
work will come from the interviewing staff is to have top-notch 
personnel. This, of course, is a very obvious statement, but it never- 
theless needs to be said over and over again. The development and 
build-up of a field interviewing staff does not mean to accumulate a 
list of names. It does mean to establish a close relationship with 
interviewers, treating them as employees of the firm and gaining their 
loyalty. This everyone takes for granted, but it is not at all easy 
to develop such a relationship. It can be developed only if the re- 
search company can provide interviewers with a sufficient volume of 
work. This does not mean current volume as much as it means a promise 
of work in the future. Work guaranty and security are important to 
interviewers; more important than high-paid spotty work. In our 
company, in order to further solidify this relationship, we have put a 
good proportion of our interviewers on a minimum wage guarrantee basis, 
regardless of whether they work or not. In return they agree to work 
exclusively for us. 


To build a field organization in this manner, it is immediately 
obvious that the total number of interviewers a company can have is 
directly related to the volume of business it has. Our company 
averages about 6,000 interviews a week, or one-third of a million a year. 
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Even with this tremendous volume of interviews, our regular staff con- 
sists of only 600 interviewers. We do have a few hundred more whom we 
can pretty well depend upon, which we turn to during peak weeks when 
we're at the 3 sigma level instead of the average. 


So long as the total number of interviewers is directly re- 
lated to the dollar volume of a company, so must be the total number 
of primary sampling units a company uses. Six years ago we were using 
55 primary sampling units; three years ago 103; today we are moving up 
to 152. This expansion in primary units corresponds to our dollar 
volume expansion. Had we, five or six years ago, tried to use 152 
primary sampling units, we would have been using names and not quality 
interviewers in these areas. Ina sense, this is a case of the "rich 
getting richer." But it still is axiomatic. Only large organizations 
can use a large number of primary sampling units with a larze inter- 
viewing staff if high-quality field work is expected. 


INTERVIEWER TRAINING AND SUPERVISORS 





A common fallacy about consumer surveys is that interviewers 
have to be personally trained for each survey. With a solid field 
organization, such a procedure is highly inefficient and wasteful. 
Interviewers are no different from other field personnel, such as 
salesmen or missionary men. Preliminary training is needed, but con- 
tinual training is not. For very special and unusual surveys, training 
may be necessary. An example of this is our recently completed "Life's 
Study of Consumer Expenditures." This was such a gigantic study, plus 
the fact that new field and questioning techniques were used, that 
training was a must. This kind of training, however, is the exception 
and not the rule. Moreover, with a loyal field staff that has been 
with an organization for some time, interviewers will soon accumulate 
more experience than we in the home office, and consequently, will 
know more about interviewing than we do. This training fallacy grew 
because research organizations had names of interviewers, not a quality 
field staff. The remedy was, therefore, put forth that interviewers 
must be personally trained for each survey -- a needless and wasteful 
effort. 


While interviewers need not be repeatedly trained, occasional 
personal supervision is necessary. We call our supervisors, "Regional 
Instructors" because their principal functions concern recruitment of 
interviewers and trouble shooting. Their supervisory duties are 
limited. Each instructor is assigned one or more primary sampling units. 
Altogether, we have 69 of them. All are on a minimum wage guaranty; 
all have been seen by members of our Field Department, and almost all 
have been to our New York offices. 


AREAS FOR MEASURING INTERVIEWER PERFORMANCE 





In spite of having a top-notch field organization which we 
know from years of experience performs excellently, it is still nec- 
essary to check the quality of every survey produced being sure each 
meets performance standards. 


In a typical consumer survey we must measure interviewer per- 
formance in the following areas: 
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Do they actually ask the questions? (The cheater problem.) 

Do they omit questions? 

. How efficiently do they complete their assignments? (cost per 
interview) 

How accurately do they make observations? 

- Do they misunderstand questions and thereby mix questions up? 
Do they distort questions? 


. 


ow - Wh kr 


Depending upon the performance areas we're checking and the 
control program established, different units of measurement and sample 
sizes are used. A unit might be a single question, a group of ques- 
tions, an entire questionnaire or the entire work of an interviewer. 


PROCEDURES FOR MEASURING PERFORMANCE 





J 


The first performance area, "The Cheater Problem," can be 
handled in three ways: 

1. Post card checks 

2. Telephone checks 


3. Personal re-interviews 


It is rarely necessary to re-visit respondents in person in 
order to verify whether or not they have been interviewed. Post card 
or telephone checks are all that is needed. In making a post card 
check, in addition to determining whether or not an interviewer called, 
a few other questions selected at random from the questionnaire are 
asked asain. This provides us with several points to check for con- 
sistency. 


It has been our experience that 15-20 interviews per inter- 
viewer is sufficient to ascertain whether or not the interviewer has 
conducted his interviews. In selecting a sample, we select a few 
interviews at random from each of his assigned locations. On the 
average, a location consists of about five or six interviews. Ina 
typical post card check, then, we will select three interviews from 
five to seven of the locations for each interviewer. We do not sel- 
ect more interviews per location because if an interviewer has cheated, 
it usually means he has not visited the location at all and a sinsle 
interview would be sufficient. We do not select less than three per 
location, however, because we do not get 100% response to our cards, 
and we want to get at least one return per location. The response 
rate averages about 40% to 45% on post card checks. Actually, it is 
much more likely for a person who has not been interviewed to return 
the card than for a person who has been interviewed. We have set up 
the following acceptance and rejection procedure: 


If any post card is returned because of "no such address" or 
"no such party at address," a 50% check on the remaining interviews of 
the interviewer is sent out. If we receive two such returns, a 100% 
check is made on his remaining interviews. If any card is received 
with the note that he was not interviewed, a 100% check is made on the 
remaining interviews. If there is an inconsistancy concerning a 
particular question re-asked, a 50% check is made on the interviewer's 
remaining work. If two inconsistancies show up, a 100% post card 
check is made. From the second mailings, if any discrepancies at all 
are ascertained among the returns, the entire work is then held aside 
for a further check by a Regional Instructor. All invalid work is 
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discarded. 


Telephone checks are used if a study is either a local one or 
one in which there is an extremely short field schedule, with interview- 
ing having to be completed within one or two weeks. The procedures and 
criteria used are identical with the post card check. 


In reality, complete cheating is a very rare occurrence. A 
more common problem is the partial or hurried interview, whereby the in- 
terviewer will either skip many questions, asking only the key ones, or 
so hurry the interview that poor quality results. This is more 
difficult to detect than cheating, and it is here where quality control 
techniques are most valuable. 


The second performance area, omitting questions, is a simple 
one for establishing standards and for quickly ascertaining whether or 
not an interviewer is under control. Here the unit of measurement is 
an entire location. Out of the total work in a location, the per- 
centage of questions omitted is determined. Our standard is 5%5 with 
appropriate limits depending upon the total number of questions. The 
first three locations completed by the interviewer are used. If an 
interviewer is found to be out of controlly omitting too many questions, 
he is immediately contacted and told of his errors and his entire work 
on that survey is then checked for omissions. An interviewer is said 
to be under control on omissions if his first three locations are in 
control. His remaining work on that survey is then not further checked 
for omissions. 


Cost per interview, performance area 3, is of great concern in 
any survey. We usually have past studies of sufficient similarity so 
that we can use standards already developed. In any single survey, 
however, we use two standards; one for metropolitan areas and one for 
non-metropolitan areas. While costs for reaching respondents is higher 
in the non-metropolitan areas, actual interviewing time is higher in 
metropolitan areas. The first four locations are selected per inter- 
viewer and the average cost per interview per location is computed. 


Interviewers who fall outside of the limits are inmediately 
examined. In this cost area, however, more often than not, those 
falling outside the limits can usually be explained by unusual circum- 
stances. In probability sampling, anything is liable to happen 
simply because interviewers often find themselves in "unusual" places. 
They have to detour because a bridge has been washed away, or the mud 
may be so heavy as to cause them to walk long distances or hire a tow 
truck to pull thein out of the mire. Unexpected interruptions during 
the interview, poor weather, etc., all can cause costs to skyrocket. 
For those interviewers for which no assignable cause can be discovered, 
two actions are taken. First, if his costs are between two and three 
signas, a letter is written to him, giving the average costs for work 
similar to his and suggesting how he might improve his performance, and 
the rest of his work for that survey is examined. If his costs are 
beyond the three sigma limit, he is asked for an explanation and a de- 
tailed check of his past performances is made, and his remaining work 
checked, If these checks, and/or his explanation are not satisfactory, 
the interviewer is dropped. In such cases, he is usually inefficient 
or is simply making excessive charges. If an interviewer's first four 
locations are under control, remaining checks are not made. 
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For these first three performance areas, standards from past 
surveys can usually be used. For the last three, concerned with 
accuracy of observations, understanding of questions and the distor- 
tion of questions, past surveys are of little help, with the exception 
of the case where we have repeat surveys. In all tailor-made or "first- 
shot" surveys, we need to develop standards from the current survey 
itself. This seriously limits the application of quality control 
techniques. 


The most obvious way to establish a criterion for evaluating 
the quality on such custom surveys is to have more experienced personnel 
"re-do" the interview, or parts of the interview. It is usually done 
by a Regional Instructor and assumes, of course, that he zets the 
"correct" answer. This call-back procedure is very expensive. In 
addition to it's expense, the great disadvantage is that it is time- 
consuming. Unless there is ample time for field work, the call-back 
method cannot be used. Moreover, certain questions do not lend them- 
selves to such treatment. You cannot ask them again and hope to get 
the same results. Knowledge questions, attitude and opinion questions, 
usually have to be avoided in a call-back procedure. Also, unstructured 
questions (or so-called "depth" questions) fall in this group, for 
with such questions considerable lattitude is permitted the interviewer. 
To try to duplicate the stimuli is virtually impossible. 


The main advantage with the recall method is it's simplicity, 
for acceptance and rejection criteria can be readily established. Sol 
Dutka, President of Audits & Surveys, has used this method with ex- 
cellent results. He does not put all interviewers under such control 
in each survey, but rather rotates them, controlling only a portion on 
each study. This method is described in The American Marketing Pro- 
ceedings of the Detroit Meetings, 1957. 


If past standards and re-interviews are not feasible, then 
each interviewer's work must be compared to the work of all other 
interviewers. There are two major problems with this procedure. The 
first problem is that differences between interviewers may not be 
because of performance, but because of real differences between one 
interviewer's group of respondents compared to other interviewer's 
respondents. For example, if an interviewer is assigned only rural 
locations, or only high-income locations, etc., we can expect this 
response to be different from others. 


The remedy would be to randomize locations among interviewers. 
This, however, leads to excessive travel costs. From a pure cost 
point of view, it is, of course, far more efficient to assign loca- 
tions as close to an interviewer's home as possible. But, this inter- 
feres with out quality control. A compromise is usually effected. In 
the assignment of locations by the Field Department, both accessibility 
and variety of areas are balanced subjectively. In this way, inter- 
viewers are assigned a variety of areas without being too expensive to 
get to. 


The second major problem facing this technique is the inade- 
quate sample size for each interviewer. We are now not operating with 
an "agree or disagree" situation as we are with the repeat interview 
technique. We are now concerned with more or less random variation. 
Our standard is something like: "20% of the public gives as it's 
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reason for no longer using product Y, poor absorption." The total 
work load an interviewer will have in a survey averages between 40-60 
interviews. Even if we used his entire assignment, we would be using 
very wide tolerances, too wide to be sensitive enough to be used for 
improving quality. 


To overcome this difficulty, we are experimenting with what 
we call, "Pattern Sheets." This is a principle which simply uses the 
relationship between questions as a criterion, thereby increasing the 
sensitivity and discriminating power of the test. These sheets are at 
present a visual device, and we have not as yet quantified their 
results. Several questions are placed across the top of the sheet and 
several placed down the side. A mass of results from questionnaires 
are plotted to establish the general relationships. The sheet will 
now look much like a scatter diagram. Several copies of these sheets 
will then be made up. On each copy, several interviewer's answers will 
now be plotted, using different colors and symbols for each interviewer. 
When an interviewer is out of control, a distinct pattern takes shape 
that is readily visible by following the same colors and symbols. These 
Pattern Sheets are very effective in detecting interviewers who do a 
partial interview and then fill out the rest of the questionnaire at 
home. This system works because interviewers, like everyone else, do 
not act as the laws of chance, nor have they the marketing superiority 
to predict what relationships will show up. Consequently, patterns 
show up very quickly. 


The following chart is an illustration of a portion of a 
Pattern Sheet. Down the left we have listed a question pertaining to 
frequency of use of one of our client's products, brand Y. Across the 
top we have two other questions; one pertaining to attitude toward 
price of brand Y, the other, frequency of use of brand X. There is no 
overall relationship between price of Y and attitude toward Y. 
However, notice that Interviewer A shows up with a strong relationship 
while Interviewer B was in control. The other part of the illustration 
shows a good relationship between usage of brand Y and it's largest 
competitor, X. Here the same interviewer (A) wrongly guessed that no 
relationship would exist. Interviewer B was again in control. 


We believe these sheets show great promise as they are not 
nearly as dependent on sample size nor on complete randomization of 
location, for relationships like this are much less dependent upon 
location characteristics than they are on behavioral characteristics, 
which are much more randomly distributed. 
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THE ECONOMICS OF QUALITY CONTROL 


Edward W, Armstrong 
Simonds Saw & Steel Company 


The basic procedures of Quality Control can be used in an existing 
manufacturing operation to improve the level of quality, increase pro- 
duction and at the same time reduce the cost of manufacture of many pro- 
ducts, This paper is presented to illustrate how a typical application 
is made and what economic rewards can sometimes be realized. 


Let us compose a synthetic situation and say that we are in the 
business of producing irregular shaped cutters, that our cost is satis- 
factory, the quality of our outgoing product is good and our customers 
are happy. Our management, however, would like to see a lower ratio of 
indirect to direct labor. The conditions are as follows: 


Production - 100,000 cutters per week 
Factory Cost - Approximately $.36 each 


Thus we are allowed: 


100,000 x $.36 x 50 weeks = $1,800,000 
per veer to purchase raw stock, direct 
labor and ell the various burden items 
connected with producing our product. 


Accordingly we decide to experiment with Quality Control, knowing 
that most of the characteristics of our irreguler cutters are visual and 
that our job will not be easy. Our personnel are all responsible for 
quality as well as quantity. Our inspection consists of a lerge group 
of final inspectors and we do not have roving inspectors, However, we 
assign one man as a Q. C. inspector in our roughing area. The first 
operation on cutters is rough forming. Our inspector, working with a 
relatively small sample from e: +h lot and reporting what he finds on a 
form as shown in Figure 1, tells us that each operator is producing some 
defective pieces and that a few of the men contribute much more than 
their fair share of rejects. We soon find that we need a sample board to 
guide our inspector in his decisions so, with the aid of our supervisors, 
one is developed. 


Our inspector classes the type of defect which he observes as; 
Wrong location, A; Dull tool, B and Twist, C. We soon find that either 
A or B are operetor controlled and we discuss them with supervisors and 
operators to encourege improvement in their work. Type C occurs only 
about 2% of the time in some lots but we can't seem to stop it. Finelly 
we decide that it occurs mostly on one machine and is not limited to eny 
particular size of cutter or operating shift. We ask "WHY?" One opera- 
tor suggests that he knows his forming machine does not always stop in 
the seme plece and possibly that is the cause of our trouble. The mester 
mechenic egrees that it is possible so the mechine is repaired and C de- 
fect is reduced. Our supervision could not previously control the prob- 
lem because it occurred so infrequently. Rather, they were accustomed 
to living with it. 
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FIGURE 1 


Working with the information taken from the Q.C. inspectors reports, 
we make fraction defective charts (p charts) of each operator for a few 
days, without showing them to the men, A sample chart of our fastest 
men, who has a history of producing at 150% of a normal rate, is shown in 
Figure 2, It is apparent from the first part of the chart that he also 
produces far too many defects and initially his 5 is 10.2%. Then we ex- 
plain the charts individually to each man, We state that, if the machine 
does not run right, if the tools are not right, if the steel is not right, 
we will fix it. We tell them our belief that the work can be improved. 
We ask them to tell us if they can't do it right. We hang the charts on 
the machine, 


Immediately we encounter from the operators a rash of reasons for 
defective work, We proceed to eliminate the reasons by repairing equip- 
ment, revising tooling, assisting operators to perfect their technique 
and doing whatever else is necessary, Within a few days, our efforts; 
combined with the psychological effect of the charts, has encouraged each 
man to produce better work, Our offending (best) operator hes made a 
great change in the quelity of his work, His 5 is now 1.2%. He hes 
discovered that, because all equipment, tools end raw stock are controll- 
ed better, he can still produce at his usual very fast rate and not make 
many defects. 


After another three weeks, our "best" operator is still puzzling to 
us, Many of the other men are doing better work than he does, yet we 
know him to be highly skilled. We discuss the reasons with him. Suddenly 
he says "When I form the last piece, it is sometimes too short." I can't 
locete it right. Yet, if I bury it in the good pieces, I get paid for it. 


I won't do that again." His 5 is .3% from then on. 
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A "p" chart for the entire rough form department is made as shown in 
Figure 3. It indicates that, within a few months, we have saved at least 
2% of the total production as good cutters rather than defectives. We 
previously found the defectives at final inspection. A rough calculation 
of the dollars saved in the forming area is as follows: 


SAVING IN FORMING 





100,000 Cutters per week x 2% x $.36 each = $720.90 per week 


$720 x 50 weeks = $36,000.00 per year 





In the meantime we have had policy decisions to make, We have been 
asked, "Who screens these defective lots?" And we have answered, "You 
made them, you find them." We have created quality consciousness among 
all the people who make rough cutters, 


All of the other operstions in the manufacture of cutters naturally 
come under our consideration. We install scrap cans at each operation 
and ask everyone who finds defective pieces to put them in the can, 
Periodically we count the amount in the cans and determine where our re- 
jects are occurring. We are like the F.B.I. after it's 10 most wanted 
men because we always have some operations that are much worse than 
others. We emphasize the fact that a continuous chain conveyer is con- 
tinually causing many defects although it only comes to our attention 
when the conveyer jams and we must shut down for s while. We convince 
management that it is to their advantage to replace the conveyer. Soon 
a different type conveyer is installed and our periodic problem has dis- 
appeared, 


The periodic count of rejects from each operation also emphasizes 
that Final Inspection can be a source of great information to us. We 
decide to use thet source to our best advantage. We have several in- 
spectors doing what is celled 100% inspection, We provide them with a 
form so they can report how many cutters they look at each day and what 
they find wrong. Immediately, we see the need of treining each one to 
work to a definite level of quality. A sample board, showing the vari- 
ous possible defects with Go and Not Go examples, is therefore developed, 
The record from the inspectors is accumleted for a few weeks and a 
fraction defective chart as shown in Figure |; is made, We find that the 
level of quality of cutters incoming for inspection is 9.6% defective. 

A measure of outgoing quality is also made so that we will have a refer- 
ence later when we need it, 


A "Gold In The Mine" survey, made to justify our request for a full 
time Q. C, investigator, shows that 80% of the defects are due to one 
operation called "Final Sizing." It is obvious that we mst work on 
this operation. We ask one inspector to report only the defects due to 
final sizing and to classify these according to the reason for their 
occurrence, We find that the reason most frequently assigned is "Under 
Adjustment." It is controlled by the operators and shows a tendency to 
leave extra stock for Re-Sizing if necessary. A fraction defective 
chart for Final Sizing is made as shown in Figure 5. We note that it 
closely follows Final Inspection, as it should. 
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All of the defects due to Final Sizing are returned there for rework. 
A liberal time standerd is allowed for the rework because it mst be done 
in small lots. Half of the cutters are eventually salvaged and half 
scrapped. We prepare Fraction Defective charts of each operator on Final 
Sizing and hang them on the machines. We note some improvement but not 
enough. We conclude that our operators seem to like the rework. Experi- 
mentelly, we determine that it is possible to operate e Final Sizing 
machine and produce nearly all good pieces if the incoming work is satis- 
factory, The operator must, however, keep the setup adjusted to the 
meen specification, We make a small increase in allowed time to cover 
the small necessary amount of rework and ask that it be done immediately 
rather than sending it to final inspection. Our fraction defective charts 
soon drop to a level which is considered satisfactory. An estimate of 
the saving made is as follows: 





SAVING IN REWORK 





About 8% of the Cutters were reworked. 
Rework Direct Labor amounts to $78.00 per week 


Re-ijnspection Labor (et ea slow rate because 
individual geging is required) emounts to 
$120.00 per week. 


About 3-1/2% of the Cutters were ultimately screpped 
after rework. 


100,000 x 3-1/2% x $.36 = $1260.00 per week 


Thus a total seving from the elimination of rework 
amounts to $1158.00 per week. 


$158 x 50 weeks = $72,900 per year 


Naturally, when the incoming work is not satisfectory, we must in- 
vestigate and try to improve it, One of our investigetions leads back 
to the previous operetion of Rough Sizing. Try as we will, this roughing 
operation continues to give trouble. Since we cannot control it, we are 
led beck to the preliminary operations to try and obtain more control 
there. Finally we decide that the best way to control Rough Sizing is 
to perform a previous operetion differently end eliminate the Rough Size 
completely. When new tools are provided, we find that it is possitle so 
the change is made, A battery of machines are discarded and their opera- 
tors are assigned to a better job. We could not eliminate the operetion 
before because we did not have control of the preliminary steps. An es- 
timate of saving made is as follows: 


SAVING BY ELIMINATING UNNECESSARY OPERATION 





The Direct Lebor required for Rough Sizing 
amounts to $670.00 per week. 


$670 x 50 weeks = $33,500 per year 












As we continue to watch the level of quality incoming for Final 
Inspection, we observe that most of the lots are very clean, It is easy 
to prove that our outgoing quality level with a good plan will be better 
than it was when we checked after 100% inspection befcre Q.C. wes in- 
stalled. A sampling plan is started so that we can find these good lots 
quickly and not waste time on them, Inspection thus becomes a job of 
determining the level of quality. It can be done much faster, More than 
half of ovr inspectors are reassigned to production work, A saving as 
follows is made: 


SAVING BY SAMPLING INSPECTION 





The inspection labor (Indirect) saved amounts 
tc #57).00 per week. 


$57 x 50 weeks = $28,700 per year 


SUMMARY OF SAVINGS 





Saving in Forming Area......... $36,000 per yeer 
Saving in Rework.............. £72, 900 " 


Saving by Elimination of 
Unnecessary Operation.......$33,500 . 


Saving by Sampling Inspection. .$28, 700 " 


Totel Saving.....£171,000 Per Year 


Conclusion: 


We are allowed $1,800,000 to buy material, labor 
and burden, We have seved prectically 10% of 
that amount. 


In addition, ovr ratio of indirect to direct labor 
is satisfactory. 


Our product is better. We meke it that wey. Our 
customers confirm it. 


Cur product is less expensive. We made a svbstential 
Saving in menufecturing ccst. 





Fortune Magezine called Quality Control "the sharpest 
management tool to come down the pike in 50 years." 
Sometimes it seems that way! 
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WHAT DOES THE AIR FORCE EXPECT CF CONTRACTORS 


Colonel J. G. Schneider 
Chief, Quality Control 
Hq Air Materiel Command 


Introduction 


Gentlemen - It is a pleasure to discuss the subject, "What Does 
the Air Force Expect of Contractors," because it provides an oppor- 
tunity for me to follow-up on two previous papers presented to the 
American Society for Quality Control. 


In 1951 at Cleveland, I introduced to the Society members their 
first look at the Air Force's new quality assurance program explained 
by a new Specification MII-Q-5923 (USAF). This document was our formal 
answer to the question, "What does the Air Force expect of contractors?" 
I might add that the reception afforded my discussion of this new cm- 
cept, as those of you who were there might remember, was like most 
things new, not too enthusiastically received, 


In Philadelphia in 1953, "The Fhilosophy Underlying the Specifi- 
cation MIL-Q-5923" was the subject of my talk, I might add that after 
two years trial, the acceptance was far more favorable than the original 
introduction, and now upon returning to Quality Control, after four 
years, the many complimentary things I hear about 5923 from contractors’ 
representatives are most gratifying. In our procurement activities 
overseas we tell the contractor what we expect of him through the me- 
dium of 5923. It may interest you to know that within the few years 
since it was originated, this specification, or facsimilies thereof, 
have received wide acceptance and adoption by our military counter- 
parts and industries in many friendly and allied nations throughout 
the world. It is presently written in the languages of most of the 
European NATO countries, as well as Great Britain, Canada and Japan, 
where it is furthering the use of quality control as a management tool. 


Top Management and Quality 


When speaking of "contractor" I am not only directing my remarks 
to the quality managers within industry but to top-level management 
as a whole, I believe we can accept as a basic principle that the 
quality of any product is largely dependent upon the policy of manage- 
ment, and theeffort, interest and emphasis which all of management 
devotes to the quality and reliability of its products. 


By "management" I mean those members of the organization who have 
responsibility and authority to plan, program, direct and control the 
industrial effort. 


The existence of a quality control organisation can be nothing 
more than a paperwork exercise and so much "window dressing" unless 
its efforts are firmly backed and supported by all other elements of 
the organization, and is not being subordinated to any interests which 
could negate quality. 








Effective quality control within the industrial organization be- 
comes a means to an end which can be summed up in terms of a quality 
product, shipped on schedule, to customer satisfaction and, last but 
not least, at a reasonable profit. If we accept these principles, 
then we must look next to the quality control manager and his obliga- 
tions which can be summed up in a few brief words, "Assure that the 
quality and reliability are in accord with established standards in 
the most economical way practicable with a minimum effect on the in- 
dustrial process," This is an over-simplified statement of a job in- 
volving a great mltitude of diverse activities which must be plamed, 
coordinated and intermeshed with practically all functions and opera- 
tions of the whole industrial enterprise. 


Quality Control and Reliability 





With the spectacular development in technology and the complexity 
of present day weapons systems, and with particular reference to mis- 
Siles, which can be "performance tested" only on a sample basis, some 
rather unique developments have taken place. The emphasis on relia- 
bility in the midst of the complexities of unmanned missiles, and 
space vehicles emphasises the need for professional quality control 
engineers, new, better, and less costly inspection techniques, and 
statistical methods which far outstrip yest y's activities. We 
learned in attempting to define reliability‘l)and then to measure it 
that there is a need for closer relationship between all industrial 
activities than ever before. Quality and reliability take on new and 
added significance in the missile era. 


Today, quality control and reliability planning must be coinci- 
dent with the initial proposal stage and simultaneously follow thru 
the development of models and test articles, the prototype and finally 
the production article, Modern techniques mst be incorporated into 
the plan and properly integrated with the overall engineering, buying, 
production planning, manufacturing and testing activities, All func- 
tions and procedures for controlling quality and reliability mst be 
reduced to written instructions and provisions made for documentation 
of the quality control effort. The system must be designed to prevent 
occurrence or recurrence of defects and a means for positive timely 
corrective action. 


The Contractor's Scope of Responsibility 





The contractor and his quality manager can no longer regard 
"quality control" as an "in plant" operation. His scope of respon- 
sibility and concern extends itself in one direction, into the sources 
and suppliers of a great variety of critical components and parts, and 
in the other direction, into the vast complex of the military esta- 
blishment, the depots, the test centers, the operating commands, and 
even to bases scattered world-wide. 


To further compound the situation, there are many instances where 
the design contractor will develop, manufacture, test, support, main- 
tain, assist in the training of military personnel, and construct and 
operate special facilities for a particular weapons system. 








Within all these areas of activity, quality control plays a vital 
part as a key element of top management. In fact, it is management in 
the broadest and most exacting sense, involving a high level of pro- 
fessional, scientific, technical and managerial skills. Planning the 
quality control program is as important and precise as all other plan- 
ning for a given project. 


Of significant import is the need for data which gives a concise 
measure of quality control effectiveness and the quality of performance 
and reliability during production, test and after delivery to the cus- 
tomer. Quality Control should be so positioned within the organization 
to assure unbiased analysis of data concerning quality and reliability 
and to redirect action information to other responsible elements of the 
organization in a fast, positive and complete manner. To be really 
effective, there mst exist a closely integrated relationship amg 
ana between all participants in the industrial program. There is an 
ever increasing need for verifiable evidence of positive control of 
quality of each item, component and subsystem as it is passed from 
vendor to subcontractor to prime contractor. 


Vendor Rating System 


Mme very effective technique in accomplishing this is the use of 
the vendor rating system. This involves the use of preaward surveys, 
as well as a post award running evaluation of vendor's quality effec- 
tiveness, It also includes the use of data feedback to the vendor and 
establishes a factual vendor quality rating which may be utilized for 
future business. I note that quite a number of contractors are utilis- 
ing such a system in some degree. The development of a well planned 
vendor rating system is not only an effective control device but per- 
mits more economic utilization of quality control manpower, This, of 
course, is in consonance with the Air Force Quality Control Specifica- 
tion 5923, This specification requires the contractor to have an 
effective and economic quality control system, and does include posi- 
tive quality assurance of subcontract or vendor parts. ‘The use of a 
vendor rating system, which requires that evidence of inspection, test 
and control of quality is furnished by the vendor or subcontractor with 
his product, can result in minimizing costly visits or assignmmt to 
the subcontractor's plant. I firmly believe that the subcontractor 
should provide proof of quality to the prime just as we look to the 
prime to show proof of quality to the Government, whenever he tenders 
materiel for acceptance, 


Cost Consciousness 





In the short period since returning from overseas, I have attended 
a number of meetings with various industry associations. In almost ev- 
ery instance, I have been impressed with the fact that cost of quality 
and cost of quality control are generally interwoven in most presenta- 
tions. On 1 March, the Dayton Section of the American Society for 
Quality Control, staged a very fine conference. The theme of the all- 
day meeting was, "Quality - Its Measurement and Cost." I notice that 
industries refinement of most quality control techniques include cost 
aspects, as well as the primary objective of improving effectiveness. 








This, I heartily endorse, as a taxpayer and also in my official capa- 
city, because positive cost controls highlight waste and losses due to 
failure to control quality. When these truths are revealed to manage- 
ment, quality gets back to standard fast. 


The Air Force Role 





Up to this point we have dwelled upon the contractor's obligation. 
What then does the Air Force consider as its part in the oversll qual- 
ity control effort? To begin with, by public law, it is our responsi- 
bility to assure that only material conforming to contractual require- 
ments is accepted for payment. This again is a very simplified state- 
ment of resvonsibility. Actually, the military part is no less complex 
and diverse than that outlined for industry. There is, however, a 
sharp distinction in the comparative extent to which the Air Force 
duplicates the activities of the contractor in determining accepta- 
bility of product, These are not only for economical or practical 
reasons, but is attributed to a very fundamental concept which became 
obvious a number of years ago. Very simply stated, quality and relia- 
bility must be designed and tuilt into the product. The only way that 
a contractor can assure that the desired quality is built in is thru 
very precise step by step examination, testing and control of every 
important operation in the production process. To accomplish this a 
substantial number of people possessing a great variety of specialized 
talents and skills are necessary. From the Air Force point of view 
then, the assurance by the Government can be achieved if the effective- 
ness of the contractor's quality control efforts can be measured by a 
systematic and objective surveillance and audit of the systems, pro- 
cedures and products, and the validity of quality data produced there- 
from properly verified. 


Why an Air Force Quality Control Specification 





It was this concept which motivated the development of the Air 
Force Quality Control Specification MIL-C-5923. This was necessary 
for two principle reasons: (ne, all Air Force planning, programming, 
training and staffing must be based on a single concept, and two, this 
concept would work only if specific contractual requirements for qual- 
ity control were established which would be commonly applicable to all 
contractors. After seven years of successful operation, this decision 
has proven beyond any question to be a correct one. With the advent 
of modern weapons systems and missiles and with what we expect to see 
in the near future, top management must exercise absolute controls over 
quality and reliability and assure that these essential characteristics 
are built into the components comprising the weapon system. Evidence 
that this is accomplished by methods subject to Air Force surveillance 
or audit facilitates acceptance decision making and results in minimum 
costs for "Government inspection." 


The reason then for a common effort and interest between the 
industry and military quality control organizations becomes obvious. 
It explains why the military is concerned with how the quality control 
manager does his job. It also explains why the military must specify, 
in general terms, what is required of a contractor in the way of an 
acceptable quality control system which assures equitable treatment 
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to all of its suppliers. It establishes the principle means by which 
the military discharges its responsibility to accept materiel that is 
in compliance with specified quality and reliability requirements of 
the contract. It protects the citizen, both as a taxpayer and as a 
competitor for Government business, 


We no longer can operate on the old customer-versus-seller con- 
cept. The quality control effort within the contractor's facility 
becomes a matter of joint concern and cooperative relationship. It 
is through these means and our participation in meetings with indus- 
trial organizations, such as American Society for Quality Control, 
Aircraft Industries Association, Electronic Industries Association 
and similar agencies, that we can move progressively forward in the 
development and improvement of these concepts and further cultivate 
the military-industry team approach. Since the Air Force views these 
advancements as a furtherance of its own objectives, it has and will 
continue to lend ful] support to such efforts. 


The Contractor Complies 





To get back then to the question, "What does the Air Force expect 
of contractors?" I must acknowledge that in many respects the expecta- 
tions have already been realized. We expect, however, that you, who 
are members of this great organization, which has contributed so much 
to the quality of military equipment, will accept the challenge before 
you now and progressively move forward in your efforts to assure even 
greater effectiveness in controlling quality and reliability, which 
are so vital to today's high performance aircraft, missiles, satellites 
and space vehicles. 


Reference: 


(1) Reliability is the probability of performing without failure 
a specified function under given conditions for a specified 
period of time (as defined by the Advisory Group on Relia- 
bility of Electronic Equipment in their report of June 
1957, titled "Reliability of Electronic Equipment"). 
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CONTINUOUS SAMPLING PLANS 


R. E. Biedenbender 
Quality Control 
Headquarters, Air Materiel Command 
Wright-Patterson Air Force Base, Ohio 


Today I would like to review some of the activity currently under 
way within the Department of Defense in the field of continuous 
sampling. This will include a brief review of the general nature of 
university research sponsored by the DD under a joint service contract, 
a brief discussion of DD contimous sampling handbooks currently being 
prepared, a detailed discussion of the Air Force Multi-level Contimous 
Attribute Sampling Plans, and a brief review of some applications of 
these plans to date. Particularly with reference to the contract re- 
search mentioned above, and to internal research and development under 
way within the services, this report cannot of necessity represent total 
DD activity in the contimous sampling field. It is hoped that it will 
adequately serve the purpose of giving some idea of the scope and nature 
of the interest and work in contimous sampling today in the DD. Simi- 
larly, these remarks cover DD activity, and do not cover activity in 
industry and universities not participating in the joint-service con- 
tract mentioned above. I will also assume that the reader is familiar 
with the circumstances in which contimous sampling plans are generally 
most useful. 


DD RESELRCH 

Time and space here do not permit a detailed discussion of research 
recently completed or under way, and necessitate assumption that, the 
members of the audience are familiar with the work of Dodge in con- 
tinuous sampling(}). Needless to say, of course, most of the work I 
will mention has generated from the early work of Dodge and Wald and 
pence ne Te Research completed to date has concentrated on the 
following areas of contimuous sampling: the effects of assuming or not 
assuming the process is in a state of statistical control, extension and 
modification of the Dodge single level procedures to include additional 
sampling levels, OC curve computation for contimucus attribute plans 

process stop rules, processes having specified degeneration 

characteristics, and assessment of plans by its income characteristics, 
rather than the A0Q, References 3, 4, 5, 6, 7, 8, 9, and 10, at the end 
of this paper contain detailed discussions of this work. 


Work presently under way includes investigation of the optimal 
level plan to choose when using milti-level attribute plans for finite 
production runs; examination of such miti-level plans without the 
assumption of the existence of the state of statistical control, and the 
effects of employing specific process stop rules; other various pro- 
duction models for continuous sampling on processes not in a state of 
statistical control; and the extension of continuous attribute sampling 
to continuous variables plans. Some work on continuous variables plans 
has already been done, independent of DD sponsorship(}) + 


DD HANDBOOKS 

Turning now to a discussion of DD Handbooks concerning continuous 
sampling presently undergoing development, these are two. The Army 
Ordnance Ammnition Center is presently preparing a handbook of single- 
level continuous attribute sampling plans, while the Air Force is 
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preparing one containing mlti-level contimuous attribute plans. Since 
this latter handbook is basically an adaption of the Air Force mamal 
(12) to be discussed in more detail during the remainder of this paper, 
I will confine the immediate discussion to the single-level handbook. 
This handbook will contain plans of two types, one following the Dodge 
and Dodge-Torrey, CSP-1 and CSP-2 procedures(]3), currently used in the 
Army Ordnance Handbook(14) and the second employing a decision rule 
which halts further inspection until corrective action has been taken on 
the process to remove sources of discrepant material, as currently used 
in the Navy Bureau of Ordnance Handbook(;5). Plans of this type are, 
in effect, a combination of the rejection number concept of attribute 
acceptance plans and the Dodge continuous plan concept. They were par- 
ticularly designed to eliminate the possibility of unlimited inspection 
by a purchaser, which would exist if the purchaser felt it necessary to 
perform the 100% inspection sequences if a straight Dodge procedure 
were utilized, The plans are described by the ameters N, the esti- 
mated number of items in a production interval (usually a day); i, the 
mumber of consecutive items inspected which mst be acceptable before 
sampling can begin; f, the sampling rate; and a, the maximm number of 
defectives allowed to occur during the production interval, before in- 
spection is halted. Note that i and f are identical to the parameters 
of the Dodge plan. Inspection begins during the production interval 
on a 100% basis. If "a" plus one defectives occur before i successive 
acceptable items are observed, inspection is halted until corrective 
action is taken, ~ If "a" or less defectives occur before i successive 
items are found, sampling at rate f is initiated. Upon occurrence of a 
defective, 100% inspection is reinstated. This is contimed until (a) 
i successive acceptable items are observed, or (b) a total of "a" plus 
one defectives in the production interval are observed. This brief 
description covers the most important concept embodied in these plans. 
Time does not permit discussion of other important possible facets of 
the plans, such as reduced and tightened inspection, mlti-station 
inspection, and action in the event of critical defects. 


Turning now to the Air Force miti-level contimous attribute 
mamal, currently being converted to a DD handbook, Air Force interest 
in such plans began when a plan of this type was implemented on the 
green-run teardown of newly procured aircraft engines, although the 
theory for such a plan was not fully developed at this time. This type 
of plan seemed to offer possibilities of decreasing the amount of in- 
spection or testing necessary when the quality history continues to show 
a@ process average mich better than the AOQ specified; it might also 
serve to soften the impact of possible fluctuations between 100 percent 
inspection and sampling, when only one sampling level is employed. 


The engine application above proved successful, and it appeared that 
such plans could also be useful in other expensive inspection or test 
areas. Consequently, when satisfactory solutions to the theory appeared 
in 1954(5)(16), the Air Force let a contract to Stanford University to 
prepare a mamual, containing a complete set of mlti-level plans, for use 
by personnel without sophisticated statistical backgrounds. 


In the development of this manual, Stanford University personnel 


visited a mumber of Air Force operated installations as well as DD 
contractors to collect data in order to evaluate the various possible 
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statistical models for maximm simplicity of administration and oper- 
ation. Actual service tests were conducted at an Air Force overhaul 
installation before the model was finalized. 


GENERAL DESCRIPTION 

AMC Mamal 74-23 sampling plans provide a procedure in which the 
fraction inspected, of the presented items, is decreased or increased 
by steps or levels, dependent upon the quality of the presented items. 
When application to a product is started, 100% of the product is inspec- 
ted until a specified number of consecutive items, i, is found free of 
defects, At that time, the inspection is reduced to a fraction, f, of 
the items presented. If i successively inspected items are found free 
of defects, the inspection rate is reduced to a new fraction, f*, of 
the items presented. This procedure is continved through the several 
levels for the plan. The use of a constant value of i at all levels of 
a particular plan simplifies administration, and lessens the possi- 
bility of inspection error in applying the procedure. Use of a geo- 
metric progression in sampling rate, permits different sampling 
frequencies for different characteristics without administrative burden, 
It also permits use of different AOQLs for major and minor character- 
istics. However, if at any level a defective item is found before i 
successively inspected items are passed, the plan provides a "spotti- 
ness" check to determine whether sampling should contime at the present 
rate or revert back to the previous sampling rate. 


The spottiness check is similar to the bs 7 of protection used 
against spottiness by the Dodge CSP-3 plans. (A very concise overall 
description of the plans would be an extension of Dodge's CSP-3 plans 
to additional ing levels.) An examination of the schematic 
diagram (Figure 1) will best give the reader an understanding of the 
procedure followed. Although this schematic shows only three levels, 
plans with from one to five levels are contained in the mamual. To 
obtain a schematic for a different mumber of levels, simply add or 
subtract to the middle levels. 


The plans are described by three parameters, i, k, and f. The 
number of successively inspected items that mst be found acceptable in 
order to shift to a smaller sampling rate is i. The mumber of sampling 
levels that may be used is given by k, and varies from one to five. 

The first sampling rate after leaving 100% is f, and each succeeding 
sampling rate is the same fraction raised to one larger power. Tims, 
for a five level plan, the inspection rates are 100%, f, f%, £3, £4, 

and f>. If f = 1/2, the successive rates are 100%, 1/2, 1/4, 1/8, 

1/16, and 1/32, If the mumber of levels is less than five, the last 
rate corresponds to the rate at the last level, i.e., if three levels as 
used, the last sampling rate is 1/8, F values of one-half and one-third 
are planned for the DD Handbook version. The method of risk indexing 
employed is the traditional AOQ@ (Average Outgoing Quality Limit). The 


underlying mathematical theory can be set up employing the Markhov 
chain process. 


RANGE OF PLANS 

Figure 2 shows the plans available using f equal to one-half, It 
should be noted that for the same AOQL each additional level requires an 
increase in the value of i. 


4ll 








AFI_AND AOQ CURVES 

The AFI and AOQ curves for the plans reveal some interesting char- 
acteristics of the plans. Figure 3 shows the AOQ curves for the same 
AOQL plan with different k values. Figure 4 shows the AFI curves for 
the same AOQL with different k values. For either very good or very bad 
quality with respect to the AOQL, the highest level plans are the mst 
efficient since they tend to either minimize or maximize inspection as 
the quality would logically dictate. This is, of course, consistent 
with general experience with lot-by-lot attribute sampling plans when 
considering single, double, or mitiple plans. 


i 


SELECTION OF A PLAN 

One of the problems historically, in contimous sampling plans, has 
been the logical selection of the i and f values for a plan after the 
AOQG@ has been set. In these plans, since f is already set, this prob- 
lem shifts to the question of what level plan should be chosen. This 
can easily be done when the production quantity is exceedingly large, 
since the AFI curves would then be good approximations. However, it is 
also obvious that for small finite quantities, more inspection could be 
required if, for example, a k equal two plan were used instead of a k 
equal one plan; this could be true even though quality was very good. 
Availability of good quality histories reduces this problem somewhat, 
since it might then be feasible to begin sampling at one of the lower 
levels. However, since extensive records are not always available, a 
better solution for selecting the optimum number of levels to use for a 
finite production run was desired. In studying this problem, it was 
decided to concentrate on solutions which would minimize inspection 
when quality was good. Theoretical studies resulted in equations of 
such complexity they had to be abandoned. Monte Carlo techniques were 
then employed, with satisfactory results. Figure 5 shows the average 
fraction inspected for the different k levels for increasing sizes of 
production runs. The incoming quality is assumed to be equal to the 
AOQL. 


Decision rules for level selection based on curves such as these 
are incorporated into a table in the manual (Figure 6), Only curves 
for f equal one-half and p equal to the AO@ are presently contained in 
the mamal. It is planned to include a similar table and curves for 
plans with f equal to one-third in the DD Handbook version. 


APPLICATIONS AND EXPERTENCE TO DATE 

Since the printing of the mammal in September 1956, a sizeable 
number of successful applications have been implemented within the Air 
Force. The size and nature of items included in these applications 
range from aircraft to EAM cards. They have also been used in con- 
trolling processes such as preservation and packaging. In the case of 
aircraft, of course, this is done only when a contractor has flown each 
individual aircraft, and has a good quality record. While it is nct 
feasible to go into great detail on these various applications, a brief 
description of one application to aircraft propellers will be given. As 
a result of a mmber of field complaints of failures of one type of pro- 
peller very early in their service life, an engineering decision was 
made to subject all such propellers to a 10 hour reliability test sim- 
lating early service operating conditions, Conferences between engin- 
eering and quality control personnel resulted in the authorization of a 
multi-level sampling plan in lieu of the proposed 100% testing. This 
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resulted in a decrease of $60,000 in the first year on the initial con- 
tract in which the procedure was incorporated. The plans have since 
been placed in other propeller contracts. 


The plans have proven quite useful in Air Force Depot maintenance 
overhaul and supply storage operations, both from a process as well as 
a product control viewpoint, since the nature of the plant layouts and 
the products overhauled are better suited for contimous, rather than 
lot-by-lot, sampling. The mmber of applications to relatively simple 
items has been surprising since it was originally believed that the 
administrative complexity would tend to result in concentration of 
applications in expensive test areas on the more complex type of product. 


A slight juggling of the plans' mathematical model has also re- 
sulted in a model for Defects-per-Unit continuous plans. Such a plan 
can easily be set up along traditional continuous attribute lines by 
defining an unacceptable unit to be one with more than some arbitrary 
mumber of defects, and assuming that defects follow the Poisson distri- 
bution(17, 18). These are currently undergoing service testing, and 
are thus not yet in finalized form, Such plans, just as the traditional 
C-chart, offer considerable flexibility in defining the unit of product, 
resulting in simplicity of administration and operation. One unexpected 
application to date, making use of this feature, is EAM card verifi- 
cation. The unit of product is defined to be 100 successive cards, 
Service test data verified that the error rate of better operators 
follows the Poisson distribution assumption. 


In summary, let me repeat that this discussion is not intended to 
cover all the activity within the DD in the field of contimous 
sampling, I+ is hoped, however, that this discussion has given a broad 
picture of the scope and nature of the interest in this field. 
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FIGURE 1 — FLOW PROCESS CHART FOR MULTI-LEVEL 
CONTINUOUS SAMPLING PLANS 











100% LEVEL 





Inspect 100% 
of the units in 
the order pro- 


duced. 


ACTION 1 


SAMPLING — 

























































































ACTION 2 


ACTION 7 


STATE IR 





Inspect next 4 




























































Inspect at [staves = {—— ACTIONS =—————-+ units following 
rate f = 1/2 ? the defective 
Resume in- 
: found. 
spection at rate 
f2=1/2 
ACTION 6 
ACTION 1 | 
ACTION 5 ACTION 8 
“it VEL NOL 2. ” ACTION 2 STATE 2R 
Inspect at STATE 2* —— ACTION3 —————_| Inspect next 4 
rate fd =1/4 units following bs 
Resume in- the defective 
spection at rate found. 
| ois 
‘ee ACTION 4 
ACTION 1 ] 
5 
ac teEN ACTION 8 
SAMPLING RATE —- oo 
LEVEL NO. 3 ACTION 2 STATE 3R 
Inspect next 4 
erases STATE 3* ACTION 3 aed adie 
rate f = 1/8 ; the defective 
Resume in- found. 
spection at rate 
= 1/8 ACTION 4 
ACTION 5 on 
ACTION 1 — If i inspected units are found ACTION 5 — If i-4 inspected units are 
to be free of defectives, shift found to be free of defectives, 
to next level. shift to next level. 
ACTION 2 — If a defective is found while ACTION 6 — If a defective is found after 
sampling at rate f*, shift to resumiag inspection at rate f = 
Seate KR. 1/2; shift to 100% inspection. 
ACTION 3 = If the 4 inspected units are ACTION 7 — If a defective is found among 
found to be free of defectives, the 4 inspected units, shift 
shit: to State K.* to 100% inspection. 
ACTION 4 — If a defective is found after ACTION 8 — If a defective is found among 


resuming inspection at rate 
f*, shift to State (K-1)R. 


the 4 inspected units, shift 
to State R. 

















FIGURE 2 ~ TABLE OF | VALUES FOR MULTI-LEVEL 
CONTINUOUS ATTRIBUTE PLANS 








Based on f = % 

AOQL k=1 k=2 k=3 k=4 k=5 
.20 - - é ee - 
15 - - - 4 5 
10 - 4 6 > 9 
075 - 6 9 ih 13 
050 5 1 15 18 20 
.040 7 14 19 22 25 
.030 ih 20 26 31 34 
.020 18 31 40 47 51 
015 25 43 55 63 69 
.010 39 65 83 95 104 
.0075 54 88 112 128 140 
.0050 82 133 168 193 210 | 
.0025 167 269 337 386 422 











k = number of sampling levels. 
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FIGURE 5 — ESTIMATE OF AVERAGE FRACTION INSPECTED FOR 
FINITE LOT SIZES, WHEN INCOMING QUALITY IS 
EQUAL TO THE AVERAGE OUTGOING QUALITY LIMIT 
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FIGURE 6 ~ MULTI-LEVEL CONTINUOUS ATTRIBUTE PLANS 
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If on entry into a block, an arrow is encountered, use the plan at the top 
or point of the arrow. 
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EVALUATION OF TUBE SCREENING PROCEDURES 
PART I - ENGINEERING EVALUATION 


Vinson M. Lockwood 
Hughes Aircraft Company* 


1.0 INTRODUCTION 


Faced with ever-increasing demands for more reliable equipment, many 
contractors engaged in the manufacture of military electronic equipment, 
particularly in the aviation and missile fields, have resorted to parts 
screening techniques as a means of enhancing the reliability of their 
products. To a large extent, this screening may be traced to the pre- 
sumption that future part performance in the equipment will be enhanced 
if the parts have previously conformed to these screening procedures. 


Insofar as electron tubes are concerned, considerable effort has 
been devoted to research, development and production refinement in order 
to improve the basic quality level of military tube types. However, 
equipment contractors paid little heed to the great strides made in the 
improvement of tube reliability over the past five years and continued to 
employ their rigid incoming inspection test procedures to screen indi- 
vidual defective tubes from purchased lots. 


As a result of this practice, incoming inspection shrinkage figures 
ran high, and the cost of "acceptable" tubes mounted. When asked for 
substantiating data to justify these practices, no individual contractor 
could supply any. Thus, the need for an evaluation of these screening 
procedures became apparent in order to justify the additional cost of 
electron tube procurement as well as to determine whether or not they 
should be included in military specifications. The military services, 
through the Bureau of Ships, requested that ARINC undertake such an 
evaluation program. 


2.0 SCOPE OF THE ARINC EVALUATION PROGRAM 


Three specific incoming inspection procedures were chosen as sub- 
jects for this evaluation. These particular procedures were represen- 
tative of tube screening procedures in general, in that they included 
tests involving visual (microscopic) inspection, X-ray examination, 
polariscopic examination, thermal shock testing, electrical inspection, 
and vibration testing. The three procedures will hereinafter be refer- 
red to as Procedures A, B, and C. 


2.1 Description of the Screening Procedures 





Procedure A consisted of a two-stage microscopic inspection and 
a swept frequency vibration noise test. The microscopic inspection cri- 
teria were derived from Bu0rd Specification MIL-E-17751; the vibration 
test was composed of peak-to-peak noise measurements under swept fre- 
quency excitation from 30 cycles per second to 1000 cycles per second at 
an acceleration of 15 g's. 





*Information presented in this paper is based on work performed by the 


author in the capacity of i~oject Engineer with Aeronautical Radio, Inc., 
prior to joining Hughes Aircraft Company. 
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Procedure B was primarily a thermal shock screening test. ‘This 
test was prec y a visual inspection for glass defects, as described 
in MIL-E-17751. ‘Then the tubes were subjected to three cycles of the 
MIL-E-1 thermal shock test. For those not familiar with the MIL-E-1 
“d@ink" test, it consists of placing room temperature tubes into boiling 
water (not lower than 97° C) for a minimm of fifteen seconds and then 
immediately transferring them to ice water (not higher than 5° C) for 
five seconds. This procedure included three such "dunkings", with a 
minimm of fifteen seconds between each "dunk". Forty-eight hours after 
this thermal cycling had been completed, the tubes were again inspected 
for glass defects in accordance with MIL-E-17751. 


Procedure C included visual inspection, X-ray examination, 
polariscopic examination, vibration testing, and electrical testing. 
Visual inspection was performed essentially in accordance with MIL-E- 
17751, and vibration and electrical tests were performed in accordance 
with the requirements of MI[-E-l applicable to the individual tube types. 
In addition to these standard tests, X-ray examination was employed to 
distinguish between metallic and non-metallic particles and to detect de- 
fective welds, and polariscopic examination was employed to detect exces- 
sive strain within the straight portion of the glass side walls. 


2.2 Description of Test Designs 





In evaluating each of these procedures, experiments were de- 
signed which had as their primary objective the comparison of screened 
tubes with unscreened tubes. Also included were studies pertaining to 
test repeatability, tube deterioration, correlation of test criteria with 
equipment performance requirements, NEL tube shield inserts, and improved 
tube types. Only those specific objectives pertaining to the subject at 
hand will be further pursued at this time. 


The sample sizes employed for evaluating each procedure were 
based upon performance estimates which had been observed beforehand as 
well as test sensitivity and test power. ‘Thus, in obtaining answers to 
the primary objective, the performance of two samples was monitored. 

The treated sample was composed of tubes which had passed the screening 
procedure. The control sample consisted of tubes selected at random be- 
forehand from the same lots but not subjected to the screening procedure; 
and in order to prevent any extraneous factors from influencing the test 
results, the principle of randomization was utilized in each evaluation. 
In addition, sample identity was unknown to the maintenance and test per- 
sonnel . 


2.3 Description of Equipment Environments and Performance 
Measurements 





For evaluating Procedure A, tubes from each sample were in- 
stalled and operated in a missile component which was subjected to elec- 
trical and vibration tests conducted in accordance with the component 
production test specification. Those tubes causing component non-compli- 
ance to test specifications, as evidenced by their removal from an as- 
signed socket by production personnel, were termed "removals". Based 
upon the total number of tubes installed within each sample, removal per- 
centages were calculated. 


For evaluating Procedure B, tubes from each sample were subjec- 
ted to simulated thermal environments which were representative of the 
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thermal environments encountered in production "potting" of electrical 
units. After a 48-hour holding period, electrical tests for plate cur- 
rent and grid current were employed to determine "removals". Here, as 
before, based upon the total number of tubes tested within each sample, 
removal percentages were derived. 


For evaluating Procedure C, tubes from each sample were in- 
stalled in each of two different systems. One system was made up of 
electronic units of the BomNav system used in B-47 type aircraft. The 
transceiver unit of the ARC-27 commmications system used in AD type air- 
craft was used as the other system. In the case of the former systen, 
all tubes removed from the equipment by military technicians in accor- 
dance with established maintenance procedures were termed failures. 
Based 6) the assumption that these failures were distributed exponen- 

tube failure rates were derived. In the case of the latter 
system, all units requiring maintenance action, i.e., adjustment or re- 
Placement of parts, were termed unit failures. Again, based upon the ex- 
ponential assumption, unit failure rates were derived. 


Of these four evaluative techniques, two were studies of ini- 
tial circuit or environment capability, since the time periods during 
which performance measurements were made included only the time neces- 
sary to ensure component compliance with production test specifications. 
The other two evaluations included deterioration studies, in that time on 
each tube or unit was monitored until failure occurred, as evidenced by 
tube replacement or unit maintenance action. 


3.0 RESULTS OF TESTS 


The results obtained in the ARINC evaluation program reveal that no 
apparent improvement in tube performance or equipment performance is ac- 
complished through the use of these tube screening procedures. 


The tube performance data for evaluating Procedure A which were ob- 
tained during production testing of a guided missile component is presen- 
ted in Figure 1. Using the sequential sampling technique, which is 
covered in the second part of this presentation, the statement that no 
difference exists that is 2 to 1 greater can be made with a Type II error 
of 0.05. You may recall that the treated sample for this evaluation was 
required to pass a rigorous microscopic inspection test and a swept fre- 
quency vibration test. Tube yields from this screening procedure ranged 
from as low as 10 percent to approximately 75 percent. 


The tube performance data for evaluating Procedure B is presented in 
Figures 2 and 3. One environment consisted of “dunking” tubes at room 
temperature into oil baths raised to different temperatures (cold-to-hot); 
the other consisted of elevating the temperature of the tubes in an oven 
and then "dunking" the tubes into a water bath at room temperature. Here, 
you may recall, the treated sample was required to withstand three cycles 
of the MIL-E-1 "dunk" test. Here again, analysis of the data fails to 
substantiate the need for the thermal shock screening procedure in that 
the control sample withstands the simulated thermal environments just as 
well as the treated sample. 


Figure & presents the tube performance data for evaluating Procedure 
C when measured in the environments associated with the BomNav system in- 
stalled in B-47 aircraft. In this case, you should bear in mind that the 
treated sample was required to successfully meet the requirements imposed 
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by visual, X-ray, polariscopic, electrical, and vibration testing; and as 
evidenced in the preceding evaluation, no improvement in tube performance 
was noted at a test power of .95 foral.3 tol ratio. Foral.ltol 
ratio, a Type II error of .15 would be involved if the statement that no 
difference exists is accepted as true. 


Figure 5 presents the unit performance data for evaluating Procedure 
C when measured in the environments associated with the communications 
system installed in AD type aircraft. This was another evaluation of 
Procedure C, but it employed a different “yardstick", so to speak. The 
effects of the tube screening procedure on tube performance were para- 
mount in the evaluation of the BomNav system, while the effects on unit 
maintenance were foremost in the evaluation using the communication sys- 
tem. Here again, no improvement was noted. This statement involves a 
test power of .95 for a ratio of 1.3 tol. 


4.0 CONCLUSIONS TO BE DRAWN FROM THE TESTS 


In brief summary, the four evaluations have demonstrated that no 
material improvement in tube or equipment performance has been attained 
through the use of these special tube screening procedures, in that the 
so-called "defective" product seems to enjoy the same probability of sur- 
vival as the so-called "good" product. Since the results are contrary to 
the predictions of the screening procedure, it appears logical that other 
factors must contribute to the degree of tube or equipment reliability 
inherent in any system. This will become more apparent after we ponder 
over the following considerations: 


4.1 Test Criteria 


For the most part, screening procedures judge the "goodness" of 
a@ product by criteria that are not necessarily related to the criteria by 
which a system judges the “goodness" of a product. For anyone with even 
the remotest experience in electronics, the following conversation be- 
tween the two individuals depicted in Figure 6 would appear commonplace: 


Bill: "But I tested this tube myself, and I know it's good!" 


Sad Sack: "I don't care who in the H--- tested the blankety- 
blank tube! It won't work!" 


4.2 Repeatability 


In some cases, repeatability of a test could allow "defective" 
product to be present in the so-called "good" product. An additional ob- 
jective included in the evaluation of Procedure A was to determine the 
repeatability of the visual inspection test. It was discovered that the 
agreement between the tests on individual tubes in a sample was of the 
order of 70 per cent, whereas the agreement between the tests on percent 
defective in the sample was of the order of 90 percent. Here the same 
operator performed the two tests; if the operator-to-operator repeat- 
ability is considered, agreement between tests on individual tubes in a 
sample would be even further reduced. If a particular test criterion is 
correlated with equipment performance requirements, this test repeat- 
ability on an individual tube classification should probably exceed 90 
per cent for it to be acceptable as a screening test. 
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Fig. 6 " . . « it won't work!" 


4.3 Test deterioration 





Another consideration that should not be passed over lightly is 
the possibility that the screening procedure may introduce deterioration 
which cancels out the improvement gained by the screening tests. Thus, 
useful life may be enhanced by the test criteria but, at the same time, 
be degraded by the testing and handling involved in the procedure. 


4.4 Sampling 


There is an old adage - "Quality must be built in... it can 
not be tested into a product." If the basic quality level of a produc- 
tion lot of tubes falls below the minimum acceptable level, it is un- 
likely that any amount of special processing by the tube user can screen 
out the defectives from the lot and thus test the desired quality level 
into the lot. ‘The very tests which have proven ineffective for screening 
purposes have been found effective in assessing lot quality level and 
thereby indicating to a certain extent the degree of reliability to be 
expected in any system. Tests of this type are not only important to the 
tube manufacturer for good quality control, but are equally useful to the 
tube user in lot acceptance on a sampling basis. Some equipment manufac- 
turers actually maintain a Vendor Rating System on the basis of such 
sampling plans. Economic pressure can then induce the tube manufacturer 
to better the quality level of his product beyond specification minimums. 
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5.0 A SUGGESTED APPROACH TO FUTURE SCREENING PROGRAMS 


In the final analysis, it has been the consistent experience of many 
investigators in the field of reliability that tolerance problems gener- 
ated by marginal circuit designs and basic misapplications of tubes far 
outweigh all other causes of equipment malfunction. For every equipment 
malfunction attributable to "inherent" tube defects, there are usually 
three to ten malfunctions which are either "induced" or "defined" by the 
application. It is doubtful that any screening procedure, no matter how 
elaborate, could offer a general, across-the-board solution to all toler- 
ance problems as well as to the inherent tube weaknesses. Infofar as 
future screening programs are concerned, it is suggested that the toler- 
ances imposed by system requirements be given primary consideration. 

Each circuit mist be carefully considered in relation to the requirements 
imposed by system design tolerances. Allowance must be made for initial 
variations in part parameters as well as variations in these parameters 
during operational use. Once these requirements have been established, 
they should be verified - first, at the bread-board stage; second, during 
limited prototype production; and finally, during production and opera- 
tional use. 


If at any point in the verification of these requirements a screen- 
ing procedure is indicated, it is further suggested that coincident with 
the adoption of such a procedure, steps be taken to evaluate its effec- 
tiveness. If proven effective, the procedure should then be incorporated 
into standard part specifications. If ineffective, it should be discon- 
tinued to conserve material and manpower. With this type of approach, 
based on sound engineering judgement and supported by invaluable statis- 
tical evidence, the path to a more reliable future should be brightened, 
indeed‘ 


(1) aRmc Monograph No. 1 








‘EVALUATION OF TUBE SCREENING PROCEDURES 
PART II. STATISTICAL ANALYSES OF DATA 


Edward Sax 
Aeronautical Radio, Inc. 


1.0 ANALYSIS OF PROCEDURE A (MICROSCOPIC EXAMINATION, NOISE TEST) 


1.1 Introduction to the Sequential Chi-Square Test 


Procedure A was evaluated by comparing the proportion of 
removals among the tubes accepted under Procedure A (P,) with the pro- 
portion of removals among tubes in the control sample not subjected to 
Procedure A (P,). This can be stated in the form of the statistical 
null hypothesia (H,) and alternative hypotheses (H,), as follows: 


H,: Pa = Pg= Py (any percent) (1) 


against the two-sided 
alternative hypotheses: 


6/2, Po = P, - 5/2 


or 

7 2? 8 (2) 
where 5 can be positive or negative. The expected frequencies for both 
hypotheses are given in their respective contingency tables, in Table l. 
(The "control" group is the group of tubes not subjected to Procedure A, 
while the "treated" group is the group accepted under Procedure A.) 























TABLE 1. CONTINGENCY TABLES: EXPECTED FREQUENCIES 
Expected Frequencies Group Breen a Band - Total 
Under the null Control nP, n(1 - Po) n 

h 
ypothesis Treated nP,, n(l - P,) n 
(My) 
Total 2nP,, 2n(1 - P,) 2n 
Under the alter- Control | n(P, - 6/2) | n(1 - P, + 6/2) n 
t hypot. , 
native hypotheses | treated | n(P, + 6/2) | n(l-P,-6/2) | n 
(M) 
Total 2nP,, 2n(1 - P,) 2n 
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From the contingency tables is derived the non-centrality 
parameter, X, of the non-central X* distribution:// 


-P,)  4P,(1-P,) 








2 2 
yey wea (2) Az) ae Ser 


1 ° 


When +e OQ, the null hypothesis is true, and the non-central X= becomes 
the X". 


To test the null hypothesis, the removal percentages Py and 
Py are compared by the chi-square test. If the observed X*exceeds its 
critical value with a prescribed probability (a, for example), then the 
null hypothesis is rejected -—- but with the understanding that a Type I 
error of size a could have occurred, since, when the null hypothesis is 
true, critical values of X*will occur a percent of the time. On the 
other hand, if the observed X“does not exceed its critical value, it 
may be concluded either that no difference exists between the two popu- 
lations, or that an existing difference was not detected, and a Type II 
error of size 6 occurred. 6 is the probability of failing to reject 
the null hypothesis when the alternative hypothesis is true. 


Since the experimenter faces the probability of making either 
type of error, he must decide, before the experiment, the size error of 
each type that he is willing to tolerate. When this is done, the sample 
size can be determined, to give the “power of the test" -- which is 
defined as the probability of detecting a difference (5) for a given 
Type I error. Its relation to the Type II error is: 


Power = 1 - Type II Error = 1 - 8. (4) 


The sample size for each group is calculated from Equation (3), when 6 
and Pp are given in the alternative hypotheses, and \ is that parameter 
of the non-central X* that will ensure the desired power. The values 

of \ are computed in Tables of Noncentral X* , by Evelyn Fix.2/ Since 
the required sample size may be large, a sequential test procedure may 
be beneficial. The sequential test is designed so that, for a given a, 
the null hypothesis will be accepted if(n) is less than, or equal to, 
B (N) -- where n is the sample size of the sequential test, and N is the 
Sample size of the non-sequential test that gives the desired power. If 
B (n) is greater than 6 (N) for any value of n, additional observations 
are taken. This procedure is presented graphically in Figure 1. When 
the null hypothesis is true, the X* value will fall in the "region for 
rejecting null hypothesis," a percent of the time; when it is false, the 
value will fall in the "acceptance region,” 6 percent of the time. If 
the X* value falls in the "region of indecision® (as it will, more than 
a@ percent of the time under the null hypothesis, and more than 6 percent 
of the time under the alternative hypothesis), additional observations 
are taken, and a X* test is made on the pooled data. 
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FIGURE 1. DECISION REGIONS FOR SEQUENTIAL CHI-SQUARE TEST 


1.2 Method of Determining the Critical Regions 





The critical region for rejecting the null hypothesis is de- 


termined by @ and the degrees of freedom of X*. In this experiment, 


Q= .05, and X? has one degree of freedom. Therefore, the critical 


region is a greater than 3.84. The critical region for accepting the 


null hypothesis is determined_by \ , -- found in Equation (3) -- and the 
critical X“-—— for example, Xq -- that satisfied the equation. 


x? 
B(n) = f ~  £(X, d, dk? = .05 (5) 
.e] 


The probability density is that of the non-central X? of one degree of 
freedom, with non-centrality parameter \ ,,. From its definition, 3/ 


f(X*, ) 4s derived easily. 








4 
, 1 - (X= p)?/2 
x* = x* where f(X) = e , ~O<X<@ 
V27 
xX = +x? a (6) 
dx? 
qx = 
ove? , 





Corresponding to a given X*, there are two values of X -- that is, + X. 


433 

















The sum of the two elements gives 





° 2 7 
(X22, )ax2 = — 9 V2 ol axe 


Von ok? 








Therefore, Equation (5) becomes 


oa 1/2 (7x2 Ps 


2 
a(n) = e = 


Von V2 x2 





oO 








E oa 1 e -1/2 (/x2- m ax? 
° Von oVx? 4 


Equation (8) is not in a convenient form for computations. In 
order to utilize the extensive Tables of Normal Probability Functio »V 
published by the National Bureau of Standards, the necessary transforma- 
tions were made. 


t = V xe *+u and v= {x2 - ae (9) 
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Then, Equation (8) becomes 


X,tH , -t?/2 
B(n) = —e 
i V2Qn 
= F(x,tu) - F(H) + F 


= F(X,tu) + F(X,-H) 





(10) 


- 1 





Since F(t) is the normal cumulative distribution function, F(t) and F(-t) 


are related in this manner: 

















F(t) + F(-t) = 1 (11 
Therefore, 
F(x +) + F(-(x,+#) ] 1 F(x, -#) + F[-(x,-#)] 
=-—- and =— 
2 2 . . 
so that 
B(n) = = {F(x tH) - F[-(x, +) ] + F(x,-#) - FL-(x,-») I} 
(x +H) <s (x -H) -y 2/2 +( 12) 
1 e 1 e 
=— J dt + : J dv 
- -(x +H) an -(x -n) {en 
Cc ot 
From Equation (9), 
xo = ang w= HTT (13) 
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From Equation (3), u =V/\ is computed; then, X_ is computed from 
Equation (13) for the limits of t and v that satisfy Equation (12). If 
v is negative, the limits of the integral should be reversed. The value 
of the integral becomes negative. 


1.3 Critical Regions of the Experiment 


In the experiment described in Part I, the statistical hypotheses 
(with a = B = .05) are: 


Ho: PQ = Pa = Py (any percent) 


Ha: (Pq = 4%, Pa = 2%) or (PQ = 2, Pa = 4%). 


The —_—— hypothesis implies that P, = 3% and 5 = 2%. From Equa- 
tion (3), 


4xP (1-P ) 
PR 2 ae 3,779 
52 


is the non-sequential sample size, based on A = 12.985, the parameter 
value for a power of 0.95. To obtain this size sample, 80 package runs 
were necessary; therefore, the sample size was increased to 3,840, to 
include all tubes from the 80 runs. 


The sequential tests were made every two package runs. The 
tubes in both groups were assigned to sockets at random, with balance 
maintained for each pair of test runs and, insofar as practical, for 
each test run. Since the control package contained 48 tubes, the sample 
size in each sequential test was a multiple of 9%. A, was computed 
from Equation (3) for the cumulative sample, and the boundary of the 
"region for accepting null hypothesis" was determined from Equation (13). 
Figure 2 presents the decision regions as shown in Figure 1, with the 
observed sequential X? values at the 32nd test run, where the decision 
to accept the null hypothesis was made. As the figure shows, the eighth 
X? value falls in the acceptance region. However, in accepting the 
null hypothesis, it is realized that the X* value would fall in the 
acceptance region five percent of the time if the alternative hypothesis 
were true — thus causing a wrong decision. 


1.4 The Interpretation of Zero Chi-Square Values 


In Figure 2, it may be seen that zero X* values occurred in 
the first, fourth, fifth, sixth, and seventh tests. This was because the 
number of removals in one of the groups was only one less than the number 
in the other group; and, due to the continuity correction factor, the 
difference between the observed and expected values became zero. 


For example, the first zero X* came from the contingency tables 
shown as Table 2. Since zero X* always falls in the "region for accept- 
ing null hypothesis," the inclination is to stop the experiment when this 
value occurs. However, the occurrence of a zero X* is attributable to 
the use of the X*, a continuous distribution, to approximate the hyper- 
geometric distribution, which is discrete. 
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° 1000 2000 3000 4000 5000 
SAMPLE SIZE (NUMBER OF TUBES) 
FIGURE 2. PROCEDURE A: SEQUENTIAL CHI-SQUARE VALUES FOR 32 TEST RUNS 
TABLE 2. CONTINGENCY TABLES: OBSERVED AND EXPECTED FREQUENCIES 
(|O-E] - 1/2)? 
x? jn > | | ) a ¢ 
E 
Tube Table of Observed Frequencies || Table of Expected Frequencies 
Group | Removed | Retained | Total Removed | Retained | Total 
Control 1 47 48 1.5 46 5 48 
Treated 2 46 48 1.5 46 5 48 
Total 3 93 96 3.0 93 .0 96 
TABLE 3. CONTINGENCY TABIE FOR EQUATION 14 
Tube Group Tubes Removed Tubes Retained Total 
Control r, n-Tr, n 
Treated r, n-T, n 
Total r 2n-r 2n 





437 











The exact probability of the contingency of a two-by-two table 
(see Table 3) based on the one-sided alternative hypothesis is: 


(rs) Pr (a-P,)"™™ (2) PF (- Po)? 


2 


iw : (14) 
Z (8) (et) OF Gr) EP ee)! 


r-r, 





where r, and r. are the number of removals from the control group and the 
treated group, * respectively; Pj and Pp are the probabilities of a removal 
from the control group and the treated group, respectively; n is the 
sample size for each group; and r = Pr, + Be If we let P, = P2, then 


(GG). GG) 
Zee) @ 





which is the general term of the hypergeometric distribution. 


When P, and Pp are small and n is large in Equation (14), the 
Poisson approximation can be used, and P becomes binominally distributed, 
as shown in Equation (15). 


if Pp, \/ Py 1 fr\ (1\% (2h 
: \r, ) P)+P., P)-P. r Sa (3) (3) 7 


for Pp = 2P). 





The probability of obtaining a value of zero x* — as illustrated in 
Figure 2 —- was determined from Equation (15); the results are given in 
Table 4. None of these zero values of X~ warrants stopping the test. 


For example, r = 3 in Test No. 1, and 
3\/1 2\* 3\/1\* /2\* 
Probability of zero x? = ( )(3) (2) + ( )(2) (2) = ,66. (16) 
1/\3/ \3 2/\3/ \3 

















TaBLE 4. PROBABILITY FOR THE ZERO X?'S SHOWN IN FIGURE 2, 
COMPUTED FROM EQUATION (15) 
P Cumulative Removals, by Tube Group 

fest ‘ Probabilit 
— Control Group | Treated Group (Two-Teil 
1 48 1 2 0.66 
4 192 7 6 0.20 
5 240 8 7 0.16 
6 288 8 9 0.14 
7 336 10 1 0.10 




















2.0 ANALYSIS OF PROCEDURE B (THERMAL-~SHOCK TEST) 


The arc sine transformation was applied to the failure percentages 
of the "cycle--tube type--manufacturer—environment" experimental unit. 
An analysis of variance was made, and it was found that cycles did not 
interact with any of the other factors. Cycles were then compared over 
the factors. The data are given in Table 5, with a table of expected 
values. X* = 2.51, which is less than the X“o5 for two degrees of free- 
dom. Therefore, there is no reason to reject the hypothesis that the 
three groups have the same removal rates. 

















TABLE 5. OBSERVED AND EXPECTED FREQUENCIES IN THERMAL-SHOCK TEST 
Table of Observed Values Table of Expected Values 

Cycles 
Failed | Accepted | Total Failed | Accepted | Total 
0 398 521 919 379 540 919 
1 345 518 863 355 508 863 
3 323 483 806 332 474 806 
Total 1,066 1,522 2,588 1,066 1,522 2,588 
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3.0 ANALYSIS OF PROCEDURE C (VISUAL INSPECTION, X-RAY EXAMINATION, 
POLARISCOPIC EXAMINATION, VIBRATION TESTING, EIECTRICAL TESTING ) 


3.1 Bombing/Navigation System 


The times-to-tube-removal were assumed to be exponentially dis- 
tributed. The purpose now is to show that, with this assumption, the 
statistical techniques used for testing hypotheses and determining Type I 
errors for the discrete case are applicable here. 


Replacements for removed tubes were not included in the compu- 
tations on this experiment. However, the sample size (approximately 
5,000 tubes for each group) was large enough, and the number of removals 
(about 100 for each group) was small enough, to justify the assumption 
that the number of removals during the observed total life (uw) followed 
a Poisson distribution with mean u/O. Then, the joint occurrence of r, 


and TL» given r, +r, is 


r.+r LL /@ Tt, bt ) Ts 
P(r,,r,/r,+r,) ” ( ; ( en 6 ( a ) 
2 


t 9 ' J / ’ 
\ a 1/4, tHe HW, /0,+u2/82 | 





The two samples were compared by the method described in Rao -/ (See 
Table 6.) 


(O-EH,)2 _ (1.9)? . (1.9)2 











2 
“ " = ,05945 
wed Ey 124.1 ~ 118.9 oo 
Oo (Not significant) 
X = .2348 
-E 
5 (FHo"FHa)? _ (5.8)? | (5.8)? _ coag 
E,, 124.1 118.9 
O° 
1) = .7443 


From Equation (12), Type II error = 0,147. 


3.2 Communications System 


The replacements for removed tubes 'were included in the compu- 
tations on this experiment. Since an exponential distribution of the 
times-to-removal was assumed, it follows that there was a Poisson distri- 
bution of the number of removals. (See Table 7.) 
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140.14 


139.86 


_ (18,26)*_ (18,26)? 





2.182 


140.14 


From Equation (12), Type II error = 


= 0.0494 


(Not significant) 





= 4,763 


139.86 


0.017. 





TABLE 6, 


PROCEDURE C -- BOMBING/NAVIGATION SYSTEM: 
OBSERVED TOTAL LIFE, OBSERVED REMOVALS, AND 
EXPECTED REMOVALS UNDER THE NULL AND ALTERNATIVE HYPOTHESES 









































Expected Removals (E,. ) 
Observed [Observed 4 
Tube Total Life |Removals 
Group Null Hypothesis | Alternative Hypothesis 
UL ae c ° : 
i | H,? 6, = G5 H,: & =1.1 & 
Control |} 1,676,710} 117 118 .9 124.7 
Treated || 1,749,343} 126 224.1 118 3 
TABLE 7, PROCEDURE C — COMMUNICATIONS SYSTEM: 


OBSERVED TOTAL LIFE, OBSERVED REMOVALS, AND 
EXPECTED REMOVALS UNDER THE NULL AND ALTERNATIVE HYPOTHESES 





Expected Removals ,,) 











Observed | Observed 
Tube Total Life | Removals 
Group Null Hypothesis | Alternative Hypothesis 
Ms Ti Hp: 9, = 0 H,: 0, = 1.3 0, 
Control || 670,701 138 139 .86 121.60 
Treated || 672,075 142 140.14 158 .40 
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WHAT DEPARTMENT OF THE ARMY EXPECTS OF CONTRACTS 


Colonel E. J. Gibson, Chief, Procurement Division 
Qfice of the Deputy Chief of Staff far Logistics 


M. Chairman, ladies and gentlemen. 


When Mr. Fales, your Program Chairman, requested Army to help in a 
session on what Department of Defense expected of contractors, we ac 
cepted with the intent of bringing you up to date on Army progress since 
a@ year ago when General Cassevant spoke at the National Convention in 
Detroit and to expand upon the presentation given by General Engler to 
your Society last fall in New York. Our chairman, Mr. Riardan, has just 
given a broad statement of Department of Defense policies. 


Before telling you of Army policy and how it affects contractors, 
we should explore differences between the various military departments. 
Air Farce as you know has held strongly to a program of almost complete 
dependence om their Specification MIL-Q-5923, while the Army and the 
Navy, except for Buder, have held out for inspections performed by 
Government personnel for purpose of determining acceptability. These 
differences stem from organization patterns and major differences in 
categories of items which make up the bulk of procurements of the 
departments. 


Like our sister services the Army, in the past few years, has 
rapidly been getting into complex, continuously changing weapons systems. 
As a result of these changes the three departments are shifting from 
widely divergent approaches to inspection and quality control to ome 
which, I believe, in the very near future will be a uniform approach. 


This does not mean that any department will adopt the single phil- 
osophy of any other department. Rather, it means that the best parts 
of the programs of all departments will be adopted, for situations 
where they fit best, by the other departments. 


Army has done this. Current Army inspectio and quality control 
policies and procedures are based on a belief in the ability of industry 
to manage the production and quality control effarts needed to deliver 
acceptable supplies and services. Basic Army policy was expressed by 
General Cassevant to your Society last May in his presentation at 
Detroit. In paraphrase he advised as follows. Contractors are respon- 
sible for controlling product quality and far offering to the military 
departments supplies and services considered by them to conform to con- 
tract requirements. Contractors should remain autonomous in the manage- 
ment of their affairs. Army is responsible for determining that con- 
tractors have successfully fulfilled their mission. General Engler, last 
fall, re-emphasized this belief of Army in the competence and integrity 
of industry. 


The revamped Army approach to inspection and quality control is set 
forth in Army Regulation 715-20, Army Regulation 715-60, the Army Pro- 
curement Procedures, and in a Logistics Directive 280-715. The essence 
of these regulations can be described simply as a requirement that 
each contractor perform, or arrange for performing, the inspections, 
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including testing, necessary to determine compliance with contract re- 
quirements. Army's methais of assuring acceptable supplies and services 
will be varied as to intensity, depending upon the success of industry 
in quality control and inspection. 





Getting more to the point on what is expected of industry, Army 
logistics Directive 280-715 and a revision of Army Procurement Procedures 
spell out the requirements and how they will be implemented. Suppliers 
will be required to perform examinations and tests set forth in specifi- 
cations so as to substantiate conformance of supplies to the specifica- 
tion requirements. In a few instances where suppliers cannot reasonably 
be expected to perform tests, the Government will so indicate in the 
specifications ar contract, and the Government will perform the tests. 
These requirements will be spelled out with appropriate contract lan- 
guage. In instances where specifications do nct have complete and 
definitive quality assurance provisions, those specifications will be 
supplemented in the contract to the extent necessary to clearly define 
what the contractor mst do regarding inspection and testing. Contrac- 
tors will be required to have available adequate test facilities, ar to 
make arrangements for the utilization of suitable test facilities. 


So far I have told yo mmly of what may seem to be additimal 
burdens om contractors. There is a corollary however. Army is re- 
emphasizing its policy that contractors in all instances should know 
what to expect in terms of requirements stemming from Government inspec- 
tion operations. And they should know in contractual form so they can 
firmly estimate costs generated due to demands of Government inspectors. 


These two seeming different missions - ame, what would be required 
of the contractor as a minimm in his inspectia to determine compliance 
with contract requirements, and, two, what may the Government do in 
assuring contract requirements have been complied with - are both 
covered very neatly by the same statements. These are the statements 
that appear in Section 4 of the Specification, or that should appear 
there. In cases where specifications require amplificatio, Army 
policy dictates that the contract shall amplify the specifications. 

This is in accardance with the policy I read for ym from a recent 
change to the Army procurement procedure. 


From your experience in using and producing to military specifica- 
tions you are aware that some of them do not have complete statements of 
required quality assurance procedures. Further, some of them will re- 
quire changes and amendments in the technical requirements. We are well 
underway in programs which will bring our specificatias into line with 
the new policies I have been discussing. This is being accomplished 
under two programs. During the last calendar year we trained better 
than 300 Army people in the principles and concepts underlying specifi- 
cation requirements, both technical and for quality assurance. Two 
main points are stressed. First, Army specifications will demand no 
higher quality in technical requirements statements than that necessary 
to accomplish the military mission. This policy is taught with a back- 
ground of the major points which set the stage for the level of techni- 
cal quality needed. These include: functioning, life, interchange- 
ability, weight where it is important, the necessity for conservation of 
critical materials, and other of the special circumstances which make 
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military equipment so different in some respects from your narmal 
commercial items. 


The second point has to do with contractors knowing what to expect 
in terms of requirements stemming from Government inspection operations. 
Army specification writers and engineers are taught in terms of specifi- 
cations establishing a limit om the maximum severity that the Government 
may impose through its inspection and quality assurance procedures. 


This last policy, even though an Army policy for some time, takes 
on added significance now that we are requiring camtractars to perform 
inspection in accordance with specification quality assurance procedures. 
To cope with the need far standardization in the way these procedures 
are established, Army is preparing an Inspection Engineering Handbook. 
This handbook, now in finished draft farm, is being coordinated through- 
out the Technical Services of the Army. The handbook defines the basic 
concepts of inspection engineering and provides the principles and prac- 
tices associated with quality assurance. It also covers all materials 
of immediate interest to inspection engineering organizations. Its 
contents include arganizatio and mission of inspection activities, the 
meaning of and activities embracing quality assurance, the development 
of quality assurance provisions, the preparation and maintenance of 
specificatios, equipment lists, standards, and handbooks, and feedback 
of information. 


Another document currently under preparation by Army is a textbook 
setting forth Army teaching on the principles and camcepts underlying 
specifications. Both of these documents will be available far sale 
through the Government Printing @fice some time during the last half of 
this calendar year. Some of you who work closely with the military 
system may be interested in gaining a better understanding of Army basic 
thinking on specifications. 


You may ask what do these changes in policy all add up to. The 
answer is simple and is almost a restatement of the creed of your Ameri- 
can Society for Quality Control. Since little can efficiently be done 
about the quality of product after it has been produced, then clearly, 
sorting type operations by military inspectors can only be wasteful. We 
mist push control of quality as far back in the history of items as we 
can. A first step, of course, is to put respmsibility back with the 
contractor. We do not intend nar are we going to stop with this step. 


We have initiated a campaign to do all we can within our persuasive 
powers to foster within industry greater use of quality control tech- 
niques throughout the process of producing or building equipment and 
facilities for the Army. This is the only way an effective industry- 
Army team can operate in this modern age. O course, it always has 
been the only way. But pressures in the past on our natioal economy 
have not been as severe, 


In organizing the Army-industry team to get maximum pride of crafts- 
manship in the production of military equipment, we are seriously con- 
sidering what may be an about face in policy. We feel that it may be 
desirable to allow, and possibly sometime in the future to require, all 
contractors to brand with their commercial trademark the military equip- 
ment they produce. This would bring home to all people in our productim 
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system the fact that their excellence, as craftsmen, will be om trial 
all through the life of the military equipment. This, of course, is 
somewhat controversial at the present time. For instance, how far into 
the componentry of an equipment will trademarking, and the use of mly 
military trademarked replacement components, be carried? 


Another question you may ask, since in the initial stages full- 
fledged quality control systems seem to be expensive to the producer, 
how about the inequities that will result when one producer installs and 
maintains a good system and his competitors do not. The military, and 
in fact the whole of the United States Government, has recently adopted 
the general policy-type answer to this question. This is in the form 
of a change to Standard Form 32, the General Provisions for Supply Cam- 
tract. Paragraph 5c of the standard provisions has been revised in one 
Place. I will quote: "The Government reserves the right to charge to 
the Contractor any additional cost of Government inspection and test 
when supplies are not ready at the time such inspectim and test is re- 
quested by the Contractor or when re-inspectim oar retest is necessi- 
tated by prior rejection." The revision is in the last ten words of the 
sentence. QO course, when a contractor does not maintain good quality 
control and inspection systems, his rejection rate is going to be higher. 
If, as stated in my quote, he is charged for reinspections or retests 
made necessary because of rejections, his competitive position will be 
in jeopardy. 


In telling ym about the change to Standard Farm 32 I used the 
words "general policy type." These were necessary since mech work and 
study will be required before we are able fully to implement the intent 
of this contract language. 


Everything I have had to say so far adds up to ome statement of 
mission. We are convinced the only way to effectively proiuce the 
equipment necessary for our military programs is to so motivate the 
producers that the supplies they produce will all be satisfactory. We 
cannot, with Government personnel or industry personnel, rely upon a 
sorting program after faulty work has been dame. Again I'll say, this 
of course is no more than the creed of your Society. 


The Army in striving far this motivation, which requires motivation 
on the part of Government personnel as well as contractar personnel, has 
first considered two broad categories of purchase=production systems. 
The first deals with items that may be clearly described and with which 
we have had experience in quality comtrol and inspections. The second 
deals with items complex in nature and so new that concrete descriptions 
and the details of quality control and inspection are beyond our reach. 


For the first category of items we will have clear-cut specifica- 
tions, in many cases describing design details, and in all cases, 
establishing the inspection procedures to be used, and will require in- 
dustry to perfarm the necessary inspections. In these cases, Army 
inspection will be reduced to a minimum as mae reasonable by the 
Quality of industry's operations. 


For the second category, we are striving for the same reductim in 
Army inspection but since we cannot establish the details of design, 
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nor of the inspectim procedures, we will describe contractually the 
quality controls in general form that contractors mst exercise. This 
will be in the form of a specificatim MII-G-14461, titled "General 
Quality Control Requirements." he of the reference documents in this 
Arny specification will be a Military Standard to be titled "Evaluation 
of Contractor Quality Cmtrol Systems." An interesting analogy may be 
drawn at this time. As I told you earlier, Army is camvinced that con- 
tractors in all instances should know what to expect in terms of 
pressures arising from Government inspection operations. This is true 
not only in inspection of supplies but also in determinations of can- 
tractors' compliance with quality control requirements. Thus, our 
military standard. 


We are publishing this evaluation procedure as a standard rather 
than an integral part of a specification in Section 4, because we will 
use the standard administratively to determine eligibility when can- 
tractors are producing simplier items for reduced Government inspection. 


In summation, the Army expects contractars on all types of equip- 
ment to perfarm the inspections and tests necessary to assure satis- 
factory equipment. Army will assist industry and expect the assistance 
of industry in developing more effective quality control programs far 
reduction of costs and conservation of critical man-hours and materials. 
I am sure that with the help of the members of this Society the effec- 
tiveness of quality control of Army equipment will be vastly improved 
during the next few years. 
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* COST REDUCTION THROUGH CONTROL GAGING 


Louis 0, Heinold, Jr, 
Federal Products Corporation 


We are all affected by changing fashions, Whether it is the bill 
for a new spring outfit, or whether it is the latest piece of precision 
equipment purchased for our shop, in one way or another we are affected 
by fashion, A few years ago the word of the day throughout industry was, 
"automation." Most advertising urged us to use products because of their 
ability to perform some small task in our own automation program, This 
urge to join the fashion vrarade came from manufacturers of hydraulics and 
pneumatic devices, chucks, pliers, screwdrivers, and virtually any com 
ponent which coulé conceivable play a small part in metal fabrication, 


Although "automation" is no lonper the word of the day, the basic 
concept which the word implies is still a dominant force in our high pro- 
duction economy, It is no longer fashionable to talk about automation 
because this is the day of missiles, Today's advertising must tie in 
with one or more of the many missiles now being manufactured in this 
country, Whether you are concerned with the manufacture of bearings, 
electronic components, packaging or office equipment, you must show legi- 
timate connection between your product and a missile when you sell it, 

A quick glance at any technical periodical will tell the story. 


One subject that is always fashionable is Cost Reduction, Today's 
economic pains make the subject more timely than hitching your product to 
a missile, And Cost Reduction is the subject of this discussion: How the 
average small lot producer in the metal working industry can take advant- 
age of some of the latest page techniques and show some honest to good- 
ness cost reduction at the same time, 


We have been seeing many chanrees in the application and use of rage 
equipment, Some of these are fundamental changes which started before 
the war, Naturally during this long period of time there have been many 
minor trends and side excursions in gage technique and Design, However by 
recopnizinge the basic trends it is now possible for the small lot produc- 
er to take advantage of some of the cost savings made available by these 
chanpes, 


Before we go further, let's define the small lot producer, We've 
been told that about 75% of those engaged in metal working in this coun- 
try have production quantities of 50 pieces or less, This means that any 
new Gage techniques which can show signifcant cost savings must have ap- 
peal to these small lot producers, 


There are two fundamental changes that have been occuring in the ap- 
plication of Gages since the War, and they both arise from the same gen- 
eral development, Specifically we are witnessing an important change in 
the time and place of inspection, This is the fundamental development of 
the past years which is changing our gage concepts and should providemore 
practical and economical inspection techniques to both the small and large 
lot producer, 


Traditionally inspection has always been a "post mortem" tool 
used to select the good and reject the defects, Today it is becoming a 
part of the manufacturing process, The time and place of gagine is chane 
ing, This basic change is two fold, We are now putting gages on or in 
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We are also vlacing them adjacent to or at 





the producing machine itself, 
the exit of a machine tool, 


The result of this trend is to do the gaging in the manufacturing 
process with the aim that the process can be halted or modified before 
defective parts are produced, This implies a control function and italso 
implies that there must be some process that is controllable, One imme- 
diately thinks in terms of fully automated production line with a closed 
loop feedback system in each step of the process, However, because this 
tyre of automatic equipment and control system is not useful to the small 
lot producer, this producer must go back one step further and take a look 
at the machines in his shop which are controllable either directly from 
gage information or indirectly from gage information given to a machine 
attendent, 


Now we all know that there are inherent variables in the production 
output of any particular type of machine tool, In-process gages are ex- 
pected to provide some kind of control over these variables, To do this 
it is necessary to understand the kinds of variables affecting the pro- 
cess, Generally speaking there are two types; piece to piece variation 
and drift variations, 


All machine tools, no matter how precisely made, have a certain 
amount of "shake" or "looseness", This is inherent because the manufac- 
turer has tolerances on the machine tool components, This "shake" or 
"looseness" will produce some variation in the output of the machine, 
Likewise, variation in the machine output will occur from deflection of 
the machine itself, Obviously, no matter how much mass or strength is 
built into the machine tool framework, the forces required by metal cut- 
ting cause deflection of the tool, This, in turn, contributes to varia- 
tion in the output, 


The product variation which occurs because of inherent "looseness" 
and deflection is called "piece to piece" variation or "scatter," 


When the output of a machine tool is statistically analyzed and a 
histogram of this output is drawn up, the "scatter" of the machine tool 
is represented by its 43 sigma limits, This is called a machine capabil- 
ity curve, It is the common desire of the machine tool builder and its 
user to keep the limits of this curve as narrow as possible. Scatter 
variations can be controlled by gaging the product while it is being pro- 
duced and using the gage information to periodically tell the producing 
machine the exact size of the part being produced, 


During a production run on any machine tool, there are numerous trend 
changes interacting to produce variations in the product, For example, 
ambient temperature tends to increase during the course of a summer day, 
Energy released as heat during the cutting action contributes toward 
product variation, These temperature changes have long-term effects, 

Some are addative, some nullify each other, At the same time other trend 
changes in the form of tool wear contribute toward product variation 


The net result of these changes are drift variations in the product, 
They are predictable and controllable by measurin g the product immedi- 
ately after it is produced and comparing it with preceding pieces, 


By moving the inspection function into the manufacturing process we 
now ask the gage to cycle the machine tool, analyze its output, accept the 
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good and reject defective parts. With gaging in this new rOle there has 
to be an economic pay-off, Obviously any gage equipment which is expect— 
ed to perform such a variety of duties is going to be more costly than 
our "final inspection" plug gare or dial snap, For any size producer 
this higher cost has to be justified, It might seem more difficult for 
the small lot producer so let's look at some of the gains available to 
hin, 


The easiest type of machine on which the small quantity manufacturer 
can control "scatter" is the plain grinder, This is done by providing 
gage equipment which can measure the product as metal is being removed, 
The gage information is then used to control the machine, Essentially, 
such a gage replaces a conventional time base cycle, provides closer con- 
trol, and a narrower scatter band, Illustration (15 shows a typical in- 
stallation, The conventional Arnold grinding gage mechanically engages 
the part, Motion of the plunger of this gage is transmitted to an Air- 
Probe, which in turn operates an Air Electric Switch, The signal from 
the Air Electric Switch is amplified by a simple Electronic Signal Unit 
whose relays are tied directly to the controls of the machine, In this 
way gage information ‘’s given directly to the machine tool, 


When control gages are supplied with new grinders, the machines are 
modified to have two rates of wheel infeed, The normal action of the 
grinder is rapid wheel advance until the wheel reaches the grinding posi- 
tion, Then wheel infeed proceeds at a fast rate until the control gage 
signals that the product is about .003" oversize, When this information 
is received by the grinder, it immediately switches to slow infeed, This 
rate of wheel advance is designed to replace the usual "tarry" which 
serves to produce good finish and true roundness, When finish size is 
"seen" by the control gage, the wheel head is immediately retracted, 


Machines which the small-lot producer has in his shop do not have 
these refinements, but he can still have all the advantages of the tech- 
nique, Normally, a grinder with a single rate of infeed has an adjusta- 
ble stop to limit the forward motion of the wheel, Without a control 
gage, this is set to stop the wheel when the part reaches finish size, 
The wheel is then permitted to "tarry" or "dwell" against this stop for a 
pre-set time, This "dwell" produces the characteristic "spark out" and 
assures the machine operator that the product will be round and have a 
good finish, 


With a control gage on such a machine, two rates of infeed are ob- 
tained by setting the stop at a point where the product will be .0002 - 
-0005" oversize, The timer is set for an abnormally long setting, The 
action of grinding requires substantial forces, These forces tend to de- 
flect the product and the machine tool, In this setup, the wheel comes 
in at the usual fast infeed rate until it hits the stop, The product and 
the machineée' are both deflected at this point. This deflection diminishes 
as soon as the forward motion of the grinding wheel is limited by the ma- 
chine stop, This diminishing deflection literally provides a second rate 
of infeed, The piece being ground tends to straighten and "spark out", 
It is rounded up and fine finish is achieved, Its diameter continues to 
be reduced so when finish size is reached, the gage sends the wheel head 
back, The timer now acts only as a safety device in the event of gage 
failure, 


This is indeed a good example of how post-mortem measurement can be 
removed from the inspection area and made a part of the production tool, 
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Coupled with a periodic correction for drift variations, parts machined 
in this fashion should not have to go through another inspection, Time 
and again, closer control of the product has been proven. 2n a machine 
whose machine capability is 60 = 70% of the product tolerance, uniform 
production in the order of less .0001" variation in size is frequently 
obtained, 


The Arnold Gauge is equipped with adjustable calipers so change over 
from one size to another is only a matter of a few minutes, This is of 
tremendous advantage to a small-lot producer, The fact that the operator 
does not have to be in attendance during the grinding cycle makes it pos- 
sible for him to supervise two or more machines simultaneously, Here is 
where real savines can be measured and skilled help freed for other 
duties, 
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SAVINGS (per piece)... .. . . *.02059 

RATE INCREASE (per shift). ......... 42 
TOTAL SAVINGS (per shift) —* 28.59 
EQUIPMENT COST AMORTIZED — 60 DAYS 


The chart in illustration #2 shows the tremendous savings which can be ob- 
tained by the use of grinding gages on adjacent machines, Here a small 
compressor crankshaft is ground on two center type plain grinders, Before 
the use of control gaging, two operators produced 1344 pieces in eight 
hours at a wmit cost of $,03585. Then both machines were equipped with 
the type of in-process grinding gage we have just described, Themechines 


_were physically repositioned so they could be attended by one operator, 


The new rate of production showed a slight increase to 1386 pieces, but 
the unit cost was slashed to less than half the old cost, The new unit 
cost is $,01526 and the equipment cost was amortized in 60 days! 


There are additional savings not even shown in this chart, For ex- 
ample the substantial gains in scrap reduction and product improvement 


haven't been included, 
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In another case an unexpected additional cost reduction was ob- 
tained. This machine, equipped with a control gage had poor hydraulic 
response, Because of this the first few pieces ground after the machine 
had been down overnight, or after it had been stopped for operator per- 
sonal time, tended to be slightly oversize, A piece removed from the 
grinder and found to be .0001" or .0002" oversize in this case would al- 
most always be scrap, Because there was no accurate way to advance the 
grinding wheel by such small increments, the removal of such smallamowts 
of stock made salvage impractical, However, by use of the Control Gage 
and observation of the meter hand only, the operator is able to salvage 
such parts and show some additional dollar savings for his employer, 


These are only two illustrations of how control of piece to piece 
variation can provide dollar savings to the user, In the average shop, 
@ little imagination will disclose other opportunities for similar con- 
trol, For example, one manufacturer obtained excellent control results 
by providing automatic cycling of his pivot polishing equipment, 


We have discussed piece to piece variation and how it can becontrol- 
led by a gage on the machine which observes the product as it is being 
produced, Similarly trend variations can be controlled by observing the 
product immediately after it is produced, Changes in the size of a pro- 
duct can be automatically analyzed statistically in the true quality 
control fashion, For example, gage equipment measuring the product 
directly after the producing machine can be set up with four limits of 
control, Two of these limits are the tolerance limits which sort the 
piece into "good" and "bad", The other two limits are the inner control 
limits sometimes called "approaching undersize" and "approaching over- 
size", These limits are energized only if a sequence of pieces crosses a 
limit, In some cases, gages of this kind are set up so the inner control 
limits are energized if two pieces in sequence cross this limit, 


The gage shown in illustration #3 provides this type of control of 
the trend variations of an internal grinder, Gage information is sent 
directly back to the machine tool which has been designed to accept this 
information and take corrective action upon its receipt, A machine tool 
like this has to be of recent origin and specifically designed tofunction 
in this manner, Most small lot producers don't have such equipment in 
their shops and new costly purchases might not easily be justified, How- 
ever it is possible for this manufacturer to make use of this technique 
without the costly controls, 


One of the most practical applications of gaging after the machine 
is shown in Fig, 4. This gage is designed to measure the product of a 
centerless grinder, For many years users of production centerless m- 
chines have recognized the advantage of measuring their product at the 
exit of the machine, Gagemakers have made numerous attempts to solve the 
problems of such measurement, A severe environment makes a complex me- 
chanical mechanism undersirable, Contact gages indicating product trend 
caused more problems than they solved, For example, on a machine remov- 
ing a small amount of stock, a contact gage would put a drag on the work- 
piece, backing it up between the wheels, This sometimes caused a wheel 
accident and always affected product finish, The presence of grinding 
grit made measurements doubtful and generally contact measurement has not 
been successful, 


An air gage is the most logical measuring system to use under these 
conditions, However, present day air gages have a relatively slow re- 
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sponse time, On a centerless erinder they can measure only pieces which 
have enough length to satisfy their long response time, Also, undercuts, 
holes, or spaces between parts compound the problem of using air gages, 


The gage in our illustration (4) meets the important requirement of 
non-contact gagine, It is a jetted air ring so there is no drag on the 
parts moving thru the machine. Most important, it makes use of an en- 
tirely new air system with a response time so fast that hich speed erind- 
ing of very short pieces is easily accomplished, In one case it is being 
used to measure a 3/8" long piece at the rate of 480 pieces per minute, 
Similar tooling in the form of adjustable air snaps are now available, 


But what of economic justification? Obviously this application is 
useful only for production quantities which might be considered optimm 
for a centerless crinder. ‘Jith "after-theemachine gaging" supervising 
the trend changes of centerless grinder, it is possible for one mechine 
attendant to be responsible for a larger number of machines than hereto- 
fore. If the current practice is to have one attendant per machine, the 
use of control parsing can change this to a two to one ratio, If the pres- 
ent ratio is two machines per attendant, it can now be three or four ma- 
chines per man, For those manufacturers who use only one such machine in 
production, substantial savings can be made by pairing it with a center 
type machine equipped with control gaging, One machine attendant merates 
both units, We could easily show additional substantial savings just by 
picturing the "old method" compared to this new technique, In a shop 
where all products go through final inspection, such a department has 
personnel, floor space, and equipment, At the same time, setup gages and 
possibly the services of inspection personnel are needed on the floor 
periodically If a quality control approach is used, it still involves 
personnel, equipment, and the need for floor space, These are some of 
the things which are drastically reduced when the inspection function is 
moved to the producing machine, Add to this the savings made by scrap 
reduction, and you most certainly more than meet the basic requirement 
of such changes: COST REDUCTION, 


The internal grinder equipped with a control gage was described 
earlier, This was an automatic installation and probably too elaborate 
for the average small-lot producer, To take advantage of the gains ob- 
tained with this technique and the same kind of technique applied to the 
centerless grinder, it is not necessary to have an automatic process, 
With a little imarination, it is very easy to see that if the inspection 
function is physically moved from some remote section of the shop direct- 
ly into the production line, gage information can be used to adjust and 
control the producing machines, Virtually, any well-designed gage, if 
placed adjacent to the machine tool, can perform these functions, Inner 
control limits can be marked on the dial and machine correction made when 
a sequence of parts crosses one of these limits, It is most important 
that the machine operator be educated in the new technique, It is point- 
less to have him go through the motions of measurement without wderstand- 
ing its significance, He must be made a partner in the new setup, His 
cooperation and understanding is fundamental to its success, To a large 
extent, final inspection and its extra costs can be safely eliminated, 


We have described today's new role for gages, The gages themselves 
have to be desiened differently to meet these changed conditions, Literal- 
ly, the buyer tells the gagemaker, "We expect to use your gage in very 
severe environmental conditions, It will not have constant attention, 
When it does have attention, this will probably be given by those least 
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FIG 4 = Centerless grinding gage - continuously mea- 
sures work at exit of machine, ignoring grooves, 
spaces between pieces, etc, and warns operator when 
size trends near tolerance limits. 








skilled in its use, We expect it to run for very long periods of time 
without drift and without resetting. We want it to perform its tradi- 
tional function of separating the "good" from the "bad" and at the same 
time help to automatically adjust producing machines." This is indeed a 
tall order for the gagemaker, He can meet the requirements, because he 
already has recognized the need for new devices, new techniques, and new 
systems of measurement, But what about the buyer? Has he recognized 
the need for new standards? 


The small-lot producer and the large-lot producer too have fallen 
behind in the recognition of these new requirements, The self-styled 
Ecuipment Engineer who has been filling his plant with traditional 
equipment is faced with a new problem, and in many cases, has done little 
to prepare himself for it, Traditionally, Equipment Engineers, Gage En- 
gineers, and others have selected gage equipment on the basis of features 
which were acceptable for a bench gage, constantly attended and easily 
replaced, Great emphasis has been placed on "ease of reading, dead-stop 
hands, color, shape", These are NOT the standards for control gage se- 
lection, They are being used far too frequently today, They have no 
place in the standards which the modern-day Gage Engineer should set for 
himself when selecting Control Gaging, Today, the gage is intimately 
tied in with the machine tool, gage down time means machine down time, 


If the Gage Engineer or whoever is responsible for new gage selec- 
tion is to obtain the gains which we have described earlier, he must be 
far more careful in his choice of measuring equipment than in the past, 
The gages are being asked to do a lot more, It is up to the Gage En- 
gineer to be sure they meet the new and higher standard, 


If the gage maker were asked to define what this new standard 
should be, there would be general agreement that the buyer should select 
on the basis of the "best system", This means he has to look past its 
cover plate and learn something about the measuring system itself, its 
capabilities, its limitations, and what he might expect from it, 


If the gage maker were asked to summarize in one word all the rules 
which should apply to this new kind of gage selection, again there would 
be arereement that the one word is "Reliability". Now this, too, is be- 
coming somewhat of a catch word, a "word of the day", so it needs a lit- 
tle explanation to see how the overall needs of a Gage Engineer can be 
defined as "Reliability" 


We have said that such things as bench space, color, speed of hands, 
etc, are standards of selection no longer applicable to today's needs, 
To arrive at a new standard, the Gage Engineer should select equipment 
with a check list of his needs in mind, This check list can be his 
definition of reliability. It can help him decide the "best system" for 
the job, 


let's take a look at some of the questions that should be on the 
Gage Engineer's check list: 


1, IS THE M@ASURING SYSTEM STABIE? Does the air or elec- 
trical system have the inherent capability of drift- 
free operation, Many of today's air systems drift 
quickly from inital settings. This might have been 
acceptable at a bench setup with an operator in constant 
attention, but such systems make inadequate control 


458 








le 


es 


ct 


os 
1d 





gages, The new gage must be able to hold its setting 
without drift for long periods of time, There is 
enough drift in the average machining process without 
buying more in the form of a gage! As an example, the 
Gage Engineer now has to analyze and compare, let us 
say, the old-style series air gage and the newer de- 
signed balanced air circuit, He has to study electrical 
stability and decide if he wants a high current system 
with its possible arcing and pitting of gage switch 
contacts or a low current system whose initial cost 
might be higher, but which would eliminate these unde- 
sirable teatures. 


IS IT A SIMPLE DESIGN OR SYSTEM? Numerous and complex 
adjustments are difficult to make on a gage which has 
now become a somewhat inaccessible machine tool com 
ponent, Mechanical adjustments should be few in number, 
easily reached, and lock positively, Zero setting of 
the measuring system should be no more difficult than 
setting your watch, Multiple adjustments for zeroing 
involving extra masters (more than one) for setting 
calibration and zero simltaneously are time consuming 
and deserve no place in today's control gaging. 





DOES IT HAVE THE REQUIRED SENSITIVITY? The traditional 
rule of thumb is that a gage should repeat to 10% of 
product tolerance, Product tolerances on centerless 
and center type grinding are often .0001 - ,0002", 
Control equipment adjustable for various sizes has to 
be chosen to meet the requirements of the closest toler- 
ance, In this case, the entire gage must be sensitive 
to ,000010", On a test piece under the same conditions 
each time, it must faithfully reproduce its reading to 
this accuracy, The Gage Engineer should make this re- 
quirement a part of his purchase specifications, 





IS_IT STURDY? The supporting gage structure should be 
designed to be consistent with its required sensitivity. 
It should be strong, but without great mass or bulk, 
The measuring system itself should be examined for 
sturdiness, In this case, our definition requires 

the system be studied for freedom from friction, 
hysteresis and other undesirable characteristics of 
old-style gages, The entire unit must be examined 
for its ability to operate successfully under severe 
conditions in the presence of water, grit, and crud, 
Check the gage for seals designed to prevent contami- 
nation from entering precision bearings and other 
important parts, 


IS IT A COMPLETE PACKAGE? Frequently proposals leave 
out units like air regulators, filters and traps, 
These are an important part of the proposed gave unit, 
which unreliable vendors later tell the customer he 

is expected to furnish himself, Other misleading pro- 
posals omit the fact that the machine solenoids cannot 
be tied directly to the gage switch contacts or gage 
relays, Additional electronic or electrical circuitry 
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is required, This means more engineering and addi- 
tional purchases for the gage buyer, He has been 
sold only gage components presumably at a lower 
price under the guise of a complete package, The 
Gage Engineer should specify that the Control Equip- 
ment is complete and ready to tie in to his machine 
tool without additional re-engineering or purchases, 


6, HOW ABOUT SERVICE AND MAINTENANCE? Obviously, a gage 
which meets the first five tests is not going to re- 
quire frequent maintenance, Nevertheless, the unex- 
pected may happen so the Equipment Engineer should be 
sure Service Engineering help for the proposed gage 
is easily available, The design itself should be 
checked for easy interchangeability of standard com- 
ponents, A stock of recommended spare parts should 
be purchased with the gage, 





There are other important details which the individual engineer 
might want to put on his own check list, The important thing is to 
recognize that this is a new field of gage application, It requires dif- 
ferent gage techniques and the ability on the part of the Gage Engineer 
to scrutinize design, air circuitry, electrical circuitry with complete 
detachment from his past prejudices, This isn't as difficult as it 
sounds because most reputable gage manufacturers are very willing to 
furnish factual advice about their equipment, This, together with a lit- 
tle courage and common sense, will enable the Gage Engineer to make these 

of Reliability a fe} s chase ifications, He will no 
longer be dependent upon the salesman's glib tongue or misleading ad- 
vertising. The frosting will be gone, but he will know that he has 
taken a big step in obtaining the gains we have described, He can assure 
his management that the end result of his gage selection will be: COST 
REDUCTION, 
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QUALITY CONTROL IN THE ASSEMBLY AND MANUFACTURING INDUSTRY 
(Abstract) 
August B. Mundel 
Director of Engineering 
Sonotone Corporation 


Because a factory makes product which human beings use the product 
must alweys be designed and manufactured so as to have the greatest 
appeal to the end market. The same is true of a management procedure. 
Quality control is a procedure used by human beings to achieve a result 
that human beings want. We have a wide range of training emong the 
workers and the suvervisors in any industrial establishment. The best 
results are usually obtained when an operator is given tools to fit his 
skill and a task within his capabilities but sufficiently difficult to 
give him some pride in performing the task. 


Our quality control jobs are done by people with all types of 
training and skills. It is a management job to fit the skill and the 
task. As a result, we often have similar jobs being done in different 

fa fashions in different operations. 


My discussion will center about some application of quality control 
procedures that have occurred in our operations and to examine them for 
the salient points. This is not a discussion of new ideas nor is it the 


ite usual application story. It is a clinical discussion of the methods 
2 used to achieve a definite end. We have decided to take a group of 
_ these applications and discuss them to determine whether they are sound 


or whether they offer further advantages over systems and procedures 
more commonly used. 


T Our quality control organization consists of four sections: 


1. Incoming Materials Control. 

2. In Process Control. 

3. Outgoing Materials Control 

4. Experimental and Advisory Service, serving Manufacturing, 
Research, Enginsering and Quality Control. 


The four activities cover a similar range of jobs to those 
described by Dr. Feigenbaum even though the descriptions are not 
idential. Dr. Feigenbaum lists his four sections es: 


a. New Design Control. 

b. Incoming Material Control, 
c. Product Control, 

d. Special Process Studies. 


We will use descriptions employed at Sonotone since it forms the 
basis for the problems to he discussed. The clinical problems are 
associated with the four divisions. They are: 


I(a) The choice of a suitable value for the Acceptable Quality 
Level, a problem common to incoming materials as well as process and 
outgoing materials control. 

I(b) A discussion of alternate methods of accepting a lot of 
material coming in in a stratified form. 
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II, An evaluation of process controls where many similar lines are 
operated or similar procedures are in use. Can the process all result 
in similar shrinkage values or similar standard deviations. 


III, Methods of maintaining process averages at outgoing material 
control. In some instances, we cannot afford the consequences of going 
to tightened inspection and must be most careful to avoid a poor process 
average. When the process average approaches an out of control condi- 
tion, what procedure ought to be adopted. 


Iv. A description of some plant experimentation designed to 
improve quality of the product or reduce shrinkage (the fraction of 
product not in accordance with specification). 


I(a) The Economical Quality Level (EQL) 


If it costs more to do an inspection operation than can be saved, 
it is economically unsound to do 100% inspection. Our quality control 
group must determine what quality level is acceptable on an economic 
basis. This is done on incoming materials and on in-plant transfers 
by equating the cost of 100% inspection to the savings made by finding 
and removing the defects. Unfortunately, the economics are not so 
easily determined on final product. We reach a break even point when 
the savings equal the cost incurred. Under these conditions, 


B 
p af +E+ 7s Economical Quality Level, where p = fraction defective 
at the break even point. The inspection of 100 pieces will yield 
p x 100 defectives. 


The expense is B = the cost of inspection a unit; $100 B is cost of in- 
specting 100 units. The possible savings are the values A, E and F 
individually in pairs or collectively as may apply where A = the cost 
of a unit which we recover when we return a defective unit to the 
vendor. E = the cost incurred in removing or correcting the defective 
if it is passed through and found at the next inspection. Sometimes 
much larger than the cost of the piece, sometimes trivial. F = the 
cost of processing the part to the next inspection. A waste if we 
process a defective. 


If the inspection cost exceeds the saving made by removing defects 
100 x B> p x 100 (A + E + F) we should not do 100% inspection. If the 
inspection cost is less than the saving made by removing defects 
100 x B< p x 100 (A + E + F) we should do 100% inspection. The break 
even point occurs when 100 B = p«100 (A +E+F) or p= ) 
+E+F 
Figure 1 is our standard form to encourage and simplify this computation. 


I(b) The Inspection of Material Received in a Stratified Form 


A shipment of small tube micas is received. The lot contains 
28,000 pieces. The lot is in a number of small boxes a1] packed in a 
larger carton. Each small box has the part number, quantity, production 
date, and the number of the die used to manufacture the pieces enclosed. 
In accordance with good sampling procedures, it is the practice of the 
incoming materials group to sort the material into rational subgroups. 
These consist of boxes made on the same die and on the same date. The 
product of two dies is always treated as two distinct lots. If the dates 
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of manufacture are rot continuous, the product of two or more dates is 
also treated as separate lots. After making the separation by dates and 
dies, each group is treated as follows: A single sample of size dic- 
tated by MIL STD 105 is selected and inspected on an attributes basis 
for missing holes and obvious defects. The material thickness, a 
critical dimension, is measured and plotted using approximately 100 
pieces from the same sample. From this we compute the grand average and 
the standard deviation to estimate the nossible percentage out of 
tolerance in thickness. In the past we would check thickness by select- 
ing ten small boxes and checking ten units from each box. If the ten 
values of X (ave) and R ranges showed control and were within specifica- 
tion, we accented the lot as within specification. The grid hole sizes 
are next checked on 25 pieces and 6 pieces ere checked for hole align- 
ment. These acceptances are made on the basis of the average value. 


Why should we check a) a large sample for attributes, b) a smaller 
sample for thickness, c) a still smaller one for grid hole size, and d) 
an even smaller one for alignment? A good bit can be said for a and d. 
The attributes sample indicates that the punch and die set is operating 
and if it is operating consistently it is not likely that a systematic 
shift in hole location can occur, Hence d must be invariant if the 
holes are present. Why should two variable b and c use such vastly 
different sample size? What is the reletive efficiency or advantage of 
the two methods of acceptance of the mica thickness? 


III The Control of Outgoing Product Process Average 


The product quality record is reviewed on the basis of the last ten 
lots submitted. The product has been accepted on the basis cf normal 
sampling. Should the process average exceed the upper limit of the 
process average for the appropriate AQL, we must go to tightened in- 
spection and incur other penalties. This we wish to avoid. 


The acceptance has been on the basis of MIL STD 105 single sampling. 
Lot size 801 to 1300, sample code letter K, sample size 110, AQL 2.5%. 
The acceptance number is 6. The rejection number is 7. The last 9 lots 
have accumulated 39 rejected pieces. The tenth lot is about to be sub- 
mitted. We examined the upper limits of process average (Table II 
MIL STD 105A) and find for a sample of 1100 pieces (sample of 110 each 
from 10 lots) the upper permissible process average for a 2.5% AQL is 
3.87% or 42 pieces. We do not want to find 4 or more defectives in the 
next sample of 110. Since our last two lots have had 5 and 6 rejects in 
the sample, we suspect our process has drifted to an undesirable level. 
How shall we treat the lot to make sure that when it is sampled the 
number of rejects is less than 3? Our procedure must obviously be to 
100% inspect to tightened limits and then sample the lot to assure our- 
selves that when inspected by the quality assurance group the lot will 
yield 3 or less rejects. Is it sufficient to go to tightened sampling 
inspection or must we be even stricter in this instance? If we must be 
stricter, how do we accomplish this within the confines of the MIL STD 
105 tables. We must assure ourselves by some sampling procedure that 
the lot when sampled will have a high probability of yielding less than 
3 defects in the 110 units checked. Suppose we decide that we can risk 
a .05 probability of rejection, This, in turn, implies that we must be 
95% certain that the lot contains no more than 3 out of 110 defectives. 
In terms of the consumer risks of .05 probability of acceptance this 
agrees closely with the 0.10% AQL for the same sample code letter K, 
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size 110 pieces from the curves (Figure 2) of MIL STD 105A. According 
to the samvling plan for an AQL 0.1 and the same lot size 801 - 1300, 
the sample size 110, the acceptance number is 0 and the rejection number 
is 1. This checks well with the Poisson. According to the Poisson, 
when p = 3/110 (the average fraction defective) and a sample size 

n = 110 is used, the product (pn) = 3.0. For this sampling plan,using 
an acceptance number of 0, the probability of accepting a lot of this 
quality is .05. Only once in 20 times will we accept a lot poor enough 
to actually have 3 defects in a sample of 110 on the average, 


In effect, we have applied MIL STD 105 and have gone far beyond the 
tightened inspection procedure to achieve a closer control of a specific 
lot on a lot tolerance % defective basis for a specific purpose. 


IV Experimental Designs 


The experimental and advisory service works with research, engineer- 
ing, production and inspection groups, assisting in the analysis of date. 
Preferably, this group assists in setting up the experiments in order to 
assure the results. 


An example of poor experimental design is represented by the work 
of one engineer who was advised that the customer reported 10% of the 
product defective. He was quite certain that the cause of these defects 
had been eliminated but just to give himself greater assurance he 
checked a sample of three units and found them satisfactory on an 
attributes basis. He and his group were chagrined at the possible 
error after it was demonstrated that the probability of selecting three 
satisfactory units from a universe containing 10% defective was .92 or 
73. Advisory service before the exveriment avoids this type of error. 


An example of a factory experiment designed to evaluate the causes 
of defects in a production tise was reported in Inspection Data and A 
Trouble Shooting Expedition(!), A combination of two levels of five 
variables was explored to determine the contribution to fraction de- 
fective of assemblers, assembly jigs, adjusters, adjusting jigs and 
final inspectors. The experimental design is shown in Figure 3. Forty 
units were processed through each numbered box. Each part of the ex- 
periment and each move of product down the line was planned before the 
experiment was started. Upon completion of the experiment, the work 
was all collected and held while the analysis of the experiment was 
done in @ most simplified manner to provide an immediate answer as to 
where the difficulties were and to help educate our personnel on true 
experimental procedure. 


The important parts of the program are 1) team effort to complete 
a task, 2) recognition and use of the advantages of statistical 
knowledge to identify areas of difficulty in contradistinction to the 
earlier example of the misguided engineer's use of too small a sample. 


(1) George Scheel, Sonotone Corporation, ASQC Middle Atlantic Regional 
Conference, February 28 - March 1, 1958, 
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Control Charts Without Calculations 
Some MOdifications and Some Extensions 


raul C, Clifrord 
State Teachers College 
Montclair, N. J. 


Abstract. The usual control chart for variables requires a separate 
data sheet, a reasonable amount of arithmetic, and two charts. The 
computation of control limits frequently requires the elimination of 
some of the original data. And for shop personnel there is still 
confusion between control limits and specitication limits. aA method 
is presented in which individual measurements are plotted, thus 
eliminating the data sheet. Control limits are established by a 
process of measuring and counting. Such charts lend themselves to 
a variety of applications, some of which are considered. In parti- 
cular this procedure gives a simple comparison of process capability 
and process achievement. 


The Industrial Efficiency of Control Procedures. Control charts 
using the range have almost completely replaced charts using the 
standard deviation. Une reason is simplicity of computation. A 
second is that the range is a better indicator of certain sorts of 
trouble. Considering the industrial usefulness of the range, it is 
surprising that substitute measures for the average-range chart have 
received so little consideration. ferrell in an exceilent paper (1) 
has presented the advantages of the mid-range. Much of the theory 
in Ferrell's article is pertinent to the charts presented in this 
paper. ‘the Narrow-Limit gaging procedure of Utt and Mundel (2) and 
the rre-Control procedures developed by Shainin and Satterthwaite (3) 
are examples of plans that are based on the characteristic of an 
individual item rather than on some statistic computed from these 
individuals, 





There is a wide variety of control problems for which the quality 
control engineer needs an extensive kit of techniques. Operationally* 
the problems run from first piece inspection and equipment set-up 
inspection to multiple factor experiments designed to discover well 
concealed causes of trouble, Statistically we have a battery of tools 
ranging from distribution analysis and control charts to regression 
analysis, analysis of variance, surface response estimation and so 
forth. In evaiuating any industrial procedure 4 would suggest that 
there are at least four aspects: engineering, economic, statistical, 
psychological. Just because the statistical aspect is the easiest to 
evaluate, we tend to overvalue the statistical etficiency and lose 
sight of the other aspects. 





Thus the charts that I will present have advantages, but in 
general they are not based on arguments regarding statistical effi- 
ciency. Rather this is a report on several years industrial experience 
which has confirmed my intuitive belief that such charts have a wide 
area of application. 1n general this area is definable as "rroblems 
or processes that require more than first piece inspection and less 
than three factor designed experiments." I do not propose that these 
should replace X = K charts, My belief is that there is no universally 
best chart--the proper one will depend on the process, the personnel, 
and the type of instrumentation and the importance of the problem. 
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Control Charts Using Individuals in Sub-Groups. io facilitate compari- 
son with the usual chart 1 have taken an example from Grant (l) page 18. 
I shali not reproduce the data, for in the proposed procedure, no 
record form is used, Hather the readings in each sub-group are plotted 
directly, the five points being plotted along a vertical line segment as 
shown in figure 1. For this one should use rather coarse graph paper 
with large squares, since for most characteristics 10 to 30 units of 
measurement cover the range of variation encountered. We now proceed 
to change this plot of individuals to a control chart. 


cpa + oo 
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Figure 1 


The first step is to indicate the median of each sample, which we 
have done here with a box. Next, using a reterence card, the indivi- 
dual ranges are measured. To do this the bottom edge of the card is 
placed at the lowest reading of the subgroup, and a mark is recorded 
opposite the largest reading in the same sub-group. Having marked the 
20 ranges on the card (shown on the right of figure 1), we counti down 
to the middle mark. ‘this gives us the median range. While it is 
sometimes necessary to interpolate, it is usually better to be conser- 
vative, so we take the median range as 5. We next look up the factors 
for the confrol limits on the, range. These are given in the appendix 
as D3 and D),. In this gase Dy, equals 2.179. This gives a range con- 
trol limit of 10.5. ‘the index card shows that three ranges are outside 
this limit. We make a range control rod as shown on the left, with 
the median range and the maximum range indicated. We then locate these 
three subgroups that are out of control and circle them as shown on 
the chart. 


Next we draw in a center line on the chart. Here we may use 

either the median of the sub-group medians or the median of the indivi- 
dual observations. ‘the number of subgroups is the deciding criteria. 
With only 20 sub-groups it is better to take the median of the indivi- 

al observations which in this example is 33.5. We then look up the 

2 factor for medians which for n = 5 is .712. ‘this tactor multiplied 
by the median range gives a product of approximately 3.5. Control 
limits are drawn at this distance on both sides of the center line, 
Sub-groups 12 and 18 have medians that are outside these control limits. 
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In a process that is so badly out of control there is not much use 
in adding limits for individuals, This has been done here just to 
illustrate the procedure. Dividing the median range by the “do factor 
for n=> gives an estimate ot sigma tor individuals of 2.2. Multiplying 
this result by 3 gives a product of 0.5. Control limits for individuais 
can be drawn at this distance on either side of the central line. 

These control limits are 27 and hu. These are our estimates otf process 
capability provided a state of control can be maintained. 





At this point it might be pointed out that the usual analysis as 
given in Grant arrives at essentiaily the same result. However, to 
do this requires two or three recomputations as out of control results 
are discarded. Such procedures tend to make the novice teel that if 
he keeps at it long enough he will discard ail the data, 


We may wish to have an overall measure of what the process 
actually produced. We could of course make a frequency distribution 
of individuals and find the mean and standard deviation of this 
distribution. However, such accuracy is seldom needed. we already 
have 33.5 as a measure of center. What is needed is a measure of 
variability which includes the five samples that showea cut of control 
conditions. To obtain this simply we count in 7 observations from 
each end. This gives us the 7 and the ¥3 percentile points. In this 
example these are 28 and 38.5 respectively. Taking one third of this 
ditference (the correct multiplying factor is .3388) gives an overall 
standard deviation of 3.5 as a measure of process achievement. lf we 
compare this overall standard deviation to the estimate of the within 
group variavility (3.5/ 2.2) we fina that the variation in the process 
is approximately 0U% more than we would have if the process could be 
controlled. 





Finally, it seems to me that control charts should be summarized 
before we become embarrassed by ancient history. Let us assume that 
the above data represents some unit - production, time or merely the 
end of a graph sheet. It can be summarized as follows: 


Average 3ued 
Process Capability 2 6.5 
Process Achievement flu.» 
X out of control 2 
R out of control 3 


{valuation and Criticism of the Median oub-Group Chart. The chart has 
several definite advantages that might be listed: 





1. The data sheet and the arithmetic involved in finding X and E 

have been @liminated, 

Since bot] the X median and the R median are not greatly 

atrected by out ot control data, recomputation of the center 

lines and control Limits is practicaily never necessary. 

3. The center line on the chart is the proper one for runs above 
and below the median, 

4. Since individual results are plotted, it is possible to show 
both control limits and specitication limits. 

5. Comparisons between process capability and process achievement 
are easy to make. The chart can easily be summarized. 


2 
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Any chart has disadvantages - at least in the wrong application, If 
the individual observations on the data sheet have auxiliary information 
tied to them - such as the piece part number, machine position or 
material - we may tind that a data sheet is necessary, if the data 
sheet is necessary, I see no reason tor using this type of chart. It 

is obvious that the chart emphasizes the average and soft pedais the 
range, and that it is most useful in those applications where the 
control of the average is ot major importance. 


‘the Mid-Range Sub-group Control Chart. A rather obvious adaptation 

is the chart where Mid-Nanges are used as the measure of center. In 
this case we plot only the extreme readings and the average of the 
extremes. ‘the analysis of ranges is the same as we have outlined 
above. ‘the control limits ior Mid-Kanges require a slightly different 
factor which I have indicated by "A>. This chart has many of the advan- 
tages listed above. It does not have advantage number 5. I find that 
the decision between the median chart and the mid-range chart depends 
to a great extent on the process. There are some generalizations that 
help in this decision: 





1. From the viewpoint of statistical efriciency for sampling 
from a normal distribution, the midrange is as good or better 
than the median for sub-groups of n = 3,4,5. For n greater 
than 6 the median is more efticient. 


2. The statistical efriciency of mean, mdian and mid-range all 
depend on the type of distribution. The mean is most effi- 
cient for the normal distribution. For distributions with 
limited range the efficiency of the mid-range increases (5). 
For distributions with extremely long tails such as the Tukey 
distribution (6) the median is the more efticient. 


However, sampling from a fixed population is only part of the 
story. If we have lack of control we have an indication that there 
is no one population involved. In such a case the median range is 
superior to the average range,s8ince the 'contaminated' ranges have 
less effect on the median, If the percent of contaminated ranges is 
very high, say 20 or 3U percent, this procedure will be poor, but so 
will any procedure. 


This question of contamination is important in choosing between 
the two charts. Suppose one sub-group had four typical readings and 
one unusually high reading. This high one we shall suppose is a 
‘wild shot.' Do we want our measure of center to show as outside our 
control limits. I think the answer is that in production we would not, 
while in research and development we would. I believe that this may 
be the reason why Ferrell prefers the mid-range chart and 1 prefer the 
median chart - we are expressing our differences as to what we want 
the chart to show in different types of applications. On a sample like 
the one described I am atraid that the operator will reset the process-- 
Ferrell is afraid that the engineer will ignore possible troubles. 


We have admitted that if the process is in control either the mid- 
range or the median is likely to be less efficient than the average. 
However, I question whether this is a real deficiency and might even 
claim that it is an advantage, I find that the usual trouble with 
control limits is that economically they are too 'tight' and that as 
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a result there is difficulty in getting people to take any action when 
the charts show points out of control, It may be that wider control 
limits would be a real benefit. As an example consider a process with 
specitication limits of 246. 


Control limits for the average would be 213k 6 
Control limits for the median would be 21.61 6 
Moditied limits for the average would be = 2.34 6 


It appears that the less sensitive limits for the median would not be 
a disadvantage unless the maximum control is needed. 


/@ 
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Figure 2 


Control Charts for Sub-Groups, Two way classitication. In many appli- 
cations each observation may be classified in two ways. One is usually 
time, the second is frequently position. As an example figure 2 shows 
the results of Preece Dip Test on the amount of galvanigation on wire. 
In this case a gaivanizing bath carried thirty wires. For sampling 
purposes the bath was considered as being in five sections, and one 
wire was selected from each section. ‘he wire numbers of the highest 
and lowest readings were recorded on the chart, but have not been 
shown here, Samples 2 and are out of control on the median. The 
cause turned out to be the method of charging plus a defective elec- 
tronic control. Sample number 2 is also out of control on the range. 
Both the high and the low readings were investigated. It was discov- 
ered that the same wire was sampled on samples 2, 12, 13, and 16, the 
readings being 4.5, 5.0, 4.5 and 6, It appears that this wire is 


giving unusually low readings. the cause appeared to be a defective 
die. 





This example is given to indicate the type of application. I 
have found this most useful in analysis of multiple spindle machines, 
filling operations with multiple heads, batch sampling with strati- 
fication and so forth. 
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Figure 3 


Control Charts for Skewed Distributions. The distribution of the 
median is approximately normal for moderately skewed distributions. 
For very skewed the data is sometimes transformed by taking logarithms. 
This same result can be accomplished graphically by plotting the indi- 
vidual readings on semi-logarithmic paper. Figure 3 shows an example 
taken from Cowden(9). The individual items are plotted, the median of 
each sub-group is circled, and the range of each group is measured 





using a reference card as before. The Median range is then found by 
counting. In this example the median range is just less than the 
length of one cycle on the log scale. This distance is then multiplied 
by the factors Dj,= 2.179 and Ag= .712. The ranges in this example are 
all in control. The distance to the control limits for medians is 
measured off on each side of the center line. In the example the cen- 
ter line is 11.5 and the control limits are 2.3 and 58, All the medi- 
ans are inside these control limits, Three sigma limits for individuals 
can be found by taking the median range (distance), dividing by 

dg= 2.257 and multiplying by 3. This distance when laid off from the 
center line gives three sigma limits for individuals of 0.5 and 250, 


The usual control chart for average and range shows several points 
out of control as can be seen in Cowden. Cowden handles this problem 
by finding the log of each observation, and making a control chart for 
this transformed data, This involves looking up the logs of 150 numbers 
plus the arithmetic to make a control chart. The time by the proposed 
method is that required to plot the readings, plus three minutes to 
find the limits, The results are practically identical. 
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Control Charts for individual Observations 1n some applications, 
especially in the process industries, there is no rational way for 
sub-grouping the data. In other applications it may be that ea sam 
ple of more than one is not justified economically. Figure li shows 
individual test results on a natural product used in the chemical 
industry. The ranges are found by measuring from one reading to the 
next. The median range is 3.5. Since individuals are so insensitive 
to shifts in average we used two-sigma control limits. These are 
obtained by dividing the median range by Gp of .¥5l and mitiplying 
by 2. This gives control limits of * 7.5. The median ot the X's is 
42.5 and we can easily see that there are many points out of control. 
Examination of the chart shows evidence of either a cycle or a shitt 
in average. in this case a cycle would have no operational meaning, 
but a shift in average could be readily explained. 





‘to analyze the data for shift, a center line and control limits 
were drawn on clear plastic. ‘his plastic overlay is then shifted up 
and down vertically. ‘The number of runs above and below the median 
must be watched carefully. After several trials we find that there 
appear to be three difterent levels or batches, as is indicated on the 
chart. We cannot guarantee that the batches terminate where the chart 
indicates they do. However it we can adjust the process to compensate 
for ditrerence in batches, we have the approximate place to make the 
change. Une test result on sample number 2): is a "wild shot" and 
should be investigated or the material set aside. 


The use of any gadget, such as the plastic overlay is ditticult 
to describe and easy to demonstrate. There is of course the criticism 
that one does not have exact probability levels for such an eye test. 
The same is true of any similar procedure, such as fitting a line to 
cumulative data on probability paper. Moreover 1 know of no exact 
statistical method for segregating batches such as we have attempted 
here. 1 can state that the actual industrial application was success- 
ful, but I cannot deny that 1 may have been fortunate. However the 
Same procedure is very useful when we know that there are definite 
batches. 
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Multiple Control Uharts on Individual Observations. Control charts 
on individual observations are especially useful when several ditferent 
characteristics are recorded tor each sample, dsince the characteristics 
can be plotted one below the other, it is easy to keep records on four 
or five characteristics on one sheet ot graph paper. I know one indus- 
try where five process characteristics and two final product character- 
istics are analyzed in this fashion, It is possible to observe not 
only the eftect of changes in individual characteristics on final qual- 
ity, but also the interaction in changes of two or more characteristics, 
Space does not permit discussion of such charts, but it would appear 
that the application is obvious, 










Control without Charts The question as to when a chart has outlived 
its usetullness is a ditficult one to discuss. However the usual 
statement that removal of charts is dangerous just does not square 
with the facts ot industry. By this time industry has discarded more 
charts than they have retained. I suggest that we should take a good 
honest look at the problem, and T believe that there are many cases 
where a chart is not needed. My question is - How much control? The 
control chart for medians can be easily adapted to a procedure with- 
out charts. To do so requires that the process capability has been 
determined and is less than the specifications. To be specific let 
us refer to figure 1. Let us assume that the process is near a state 
of control, that the target of 33.5 is satistactory, and that control 
is to be based on a standard deviation of 2.5. ‘I'he operator is given 
the following instructions: 








1. Measure each or a sample of five items a 

2. ‘The median is to be between 33.5 2.0 (X tA 6) 

3. The range is to be less than 13 

hk. If the results are out of control in one out of 20 checks 
the process is satisfactory. It out of control more than 
one time in twenty, a chart will be temporarily reinsti- 
tuted. 


This procedure can be easily adapted to most measuring instruments by 
using a slightly frosted plastic overlay, Un this the operator marks 
each reading as he takes it. The control limits for the median are 
pre-marked on the overlay. It it is necessary a parallel scale giving 
the limits for the range could be attached. I have never found this 
particularly useful. 


wo Industrial experience has indicated that there are many cases 
ere a simpler procedure than the usual control chart would be useful. 
Some ot the advantages of control charts using individual observations, 
with control based on the median, have been indicated. Some applica- 
tions to ditterent types of control problems have been indicated. 

These charts have been used in a variety ot industries and the economic 
and psychological advantages appear to outweigh any statistical disad- 
vantages in many applications. 
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Mathematical Appendix 


Untortunately there is no standard set of symbols tor medians and 
midranges. ‘the symbols used here have one advantage - the formlae 
for median and midrange charts are similar to the standard one for 
X and R chartse, The symbols are defined as follow: 


“2, = median of parent universe 
x = median of sub-group 
"~ = midrange of sub-group 
ry = median of all individuals in sample 
x 
XL = median of sub-group medians 
yw ¥ = median of sub-group midranges 
h = median of sub-group r ‘ges 
G@ 6° = &' standards Given 
ae ra T Je 
g2k'=36, Rk "36 A= aA//ky 
= - oF «a ; 
SF ae 4a ae, Fearey . 
4 
Estimates : X' = Lor & or @ 
6 = B/G 
24 
3 6 1imits for q 1: i, ® or ce: rs 
¢ tet 
~~ 
n 1, and ., Kn 


2 2 
Note: By = 6 / - this is tabulated in reference 8, the tables 
that follow hav computed trom references 1, 6, 7 and 8, 
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TESTS OF SIGNIFICANCE I = NORMAL MODELS 


Gayle W. McElrath 
Head, Industrial Engineering Division 
University of Minnesota 


Introduction: 


Many of the concepts that are fundamental to tests of significance 
are already the property of the quality control engineer, and their 
application involves at most an extension of the principles of statis- 
tical quality control. The main effort of this paper is directed toward 
introducing those statistical concepts and methods which give a quanti- 
tative measure to situations for which the quality control engineer 
already has a strong intuitive feeling. We can only hope to illustrate 
a few of the tests of significance concerming the normal model in order 
to give an appreciation of their usefulness. 


The average quality control engineer is not making the most of 
Significance tests. However, there are many opportunities in each area 
of quality control for the applications of such tests. In many in- 
stances the engineer cannot answer questions concerning process level 
or variation because he is not acquainted with—or even aware of—the 
statistical tools which are available. 


NEW WORDS FOR OLD IDEAS 


One of the reasons for the lag in time between the theory and 
methods developed by statisticians and their application by the quality 
control engineer is simply a matter of language difference. The precise 
terminology of the mathematical statistician often seems difficult and 
even "unreasonable" because of its unfamiliarity. However, the quality 
control engineers who make a sincere effort to understand the language 
of statistics are early rewarded by finding that the basic and funda- 
mental ideas of significance tests are natural and extremely useful. 


The engineer considers "a variable which is always subject to some 
stable system of chance causes." (1) The outcomes of system of chance 
causes can be of many types. However, analysis is easier when the out- 
comes are assigned numbers; we attempt to identify a number with each 
outcome of the chance cause system or chance experiment. This assigned 
value is called a RANDOM VARIABLE. We make the following definition. 





Definition: A random variable is an assignment of numbers to 
outcomes of a chance experiment. 


A random variable is called "random" because the experiment is an 
experiment of chance. It is called "variable" because it is a numerical 
quantity which varies from one trial to another trial of the experiment. 
The frequency of occurrence of various outcomes can often be represented 
by a MATHEMATICAL MODEL of the numbers to which probabilities of the 
outcomes have been defined. Such a model might also be called a 
FREQUENCY FUNCTION which gives the pattern of the potential values 

that the random variable can take on. 


This paper assumes that the reader is acquainted with the concepts 


of mean, variance, and standard deviation. In the following discussion 
the terms mean, average, and process level will be used synonymously. 
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However, we must emphasize that it is of the greatest importance to 
distinguish between the mean and variance of a population which are 
PARAMETERS and the similar quantities calculated from a random sample 
which are called STATISTICS. A parameter is a population measure which 
is usually an unknown constant, and a statistic is a sample measure 
which is a random variable. 





When (1) sampling with replacement and mixing, (2) sampling without 
replacement from an infinite population, and (3) sampling in cases where 
the population had to be imagined, the observations of the sample are 
independent, and they share a common pattern of potential values. 
Furthermore, that pattern of potential values for each observation is 
exactly that of the underlying random variable or "population." We 
introduce, then, the following definition: 


Definition: A random sample is a sample whose n observations may 
be represented by the mathematical model of n inde- 
pendent random variables, each a replica of the popula- 
tion from which the sample is taken. 


Even though we shall consider principally random samples in the 
sense of the above definition, we should mention that it is possible 
to modify the formulas to cover the case of sampling without replace- 
ment from a finite population. Most of the theory of statistical 
inference from samples is based on the assumption that the sample is 
a random sample from some population. 


The mathematical model that the statistician constructs for 
practical use in interpreting data is often discovered by noting what 
happens when sample after sample is drawn from the same population. 

All of us have made the observation, for example, that the mean varies 
from one sample to another. Since we assume that the method of selection 
is kept uniform through the sampling process, the discrepancies that we 
observe can logically be assigned only to the sampling process—the 
population remains constant. It seems reasonable, therefore, that we 
call these fluctuations from sample to sample, SAMPLING ERRORS. This 
fluctuation of the statistic from sample to sample can be shown to be 
random; hence, the statistic itself is entitled to its own frequency 
function which is often called the SAMPLING DISTRIBUTION. Most every 
apolication of statistical methods involves one or more sampling 
distributions. 


Before we proceed further, let us mention that the quality control 
engineer utilizes most of the above theory when he constructs an X<X 
control chart. He thinks in terms of a measurable quantity (random 
variable) which has a stable pattern of variation (frequency function). 
When he takes samples (random samples), he calculates the sample mean 
(statistic). If he has assumed that the pattern of variation of the 
measure is a normal frequency function, then it can be shown by math- 
ematics that the distribution of the sample mean (sampling distribution) 
is also normal. If the engineer does not assume the pattern of variation 
to be normal, he takes advantage of one of the most practical and one of 
the most useful results from mathematical statistics known as the 
CENTRAL LIMIT THEOREM.(2) The theorem states that with increasing 
sample size the distribution of means for samples from any population 
which has a mean and variance tends toward the normal distribution as 
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a limiting form. This is one of the reasons that we prefer to use the 
chart rather than an individual's chart, especially when we are not 
too well acquainted with the parent population of the process. 


CONTROL CHARTS AND SIGNIFICANCE TESTS 


Now consider a few more control chart concepts which are already 
the property of the quality control engineer and which can be directly 
translated into a discussion of significance tests. When using the 
control chart technique, the quality control engineer has a sequence 
of samples generated by a manufacturing or measuring process. If every 
set of sample values behave like a random sample from a fixed population, 
the system is said to be in STATISTICAL CONTROL. The assumption on 
which most statistical methods are based, including significance tests, 
is that a state of statistical control exists. Lack of control implies 
a shift in the population. What is the logic behind such a statemamt? 


Consider a familiar application of an X control chart. It is to 
be used "to provide a basis for current decisions during production as 
to when to hunt for causes of variation and take action intended to 
correct them." Assume that the process has been in statistical 
control for a long enough time to determine the process average and 
the 3-sigma control limits. These control limits serve as the criterion 
to decide whether the process is still operating at tne same long run 
process level. By using the control limits which were determined from 
past data to make our present decision, we are actually making the 
assumption that the long run process level has not changed and we are 
testing this assumption. That is, we are making an assumption con- 
cerning the nature of the frequency function of the observable random 
variable. This assumption is called a STATISTICAL HYPOTHESIS. 


An assumption that completely specifies the population is called 
a SIMPLE HYPOTHESIS. If the assumption does not completely specify 
the population, the hypothesis is called a COMPOSITE HYPOTHESIS, 


The rule which decides between the two courses of action—namely, 
that the process is "in control" or "out of control"—and is based on 
some statistic computed from a sample of a given size is called a 
STATISTICAL TEST. The point plotted is the SAMPLE POINT. To conclude 
that the process is “out of control" is to REJECT the hypothesis; and 
to conclude that the process is "in control" is to ACCEPT the hypothesis. 
We might very well speak of the region outside of the control limits as 
the CRITICAL REGION or REJECTION REGION, and the region within the 
control limits as the ACCEPTANCE REGION, It is clear that the outcome 
of the test is reduced to either accepting or rejecting the HYPOTHESIS 
UNDER TEST. We are considering a TWO DECISION problem. 


In the above discussion of the X control chart, if the control 
limits are established from a statistical point of view, the decisions 
of acceptance or rejection are made on a probability basis. If a sample 
point falls in the critical region, there are two possibilities: either 
a very rare event has occurred, and the hypothesis is true, or there 
has been a SIGNIFICANT change in the process level and the hypothesis 
is false. We bet on the latter. Just how rare the event must be in 
order that the hypothesis be rejected must be defined. If the 3-sigma 
limits are used on the X control chart and if the sampling distributim 
of the means is strictly a normal frequency distribution, a process 
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which is unchanged will show sample points outside the 3-sigma limits 
only 26 times in 10,000. Since we are taking sample values to make an 
inference about the long run process level, we accept the fact that we 
are going to reject a true hypothesis with a certain probability. That 
is, we are willing to run the risk of making a wrong decision, for 
example, 26 times in 10,000 and say that a sample point beyond the 
3-sigma limits signifies a change in the process level. This risk that 
we take in rejecting a true hypothesis (that is, of saying the process 
level has changed when actually it has not) is called the LEVEL OF 
SIGNIFICANCE and is denoted by the Greek letter a (alpha). A true 
hypothesis will be rejected a percent of the time. Statisticians 
call this error a TYPE I error. 


The quality control engineer is also running the risk of making 
another type of error—the error of stating that the process level has 
not changed when in reality it has. The error of accepting the hypoth- 
esis when it is false is labeled the TYPE II error. The risk (proba- 
bility) of a TYPE II error is designated by the Greek letter B (beta). 
This is to indicate that 8 percent of the time the test will allow an 
actual change in process level (of preassigned size) to slip by un- 
noticed. The technique of imposing the limit B at the desired point 
will be illustrated numerically later. 


We have stated that the decision to accept or reject a hypothesis 
is based upon sample information and is characterized by risks that we 
are willing to take that our decision be wrong. What we mean by a 
wrong decision is conveniently illustrated in Figure 1. This figure 
occurs in Wald's fundamental paper on the mathematical theory of 
decision problems, and illustrates very clearly the four possible 
situations resulting from a decision. ) 





























TRUE HYPOTHESIS TRUE HYPOTHESIS FALSE 
SITUATION (Level Unchanged) (Level Changed) 
ACTION TAKEN DESCRIPTION OF DECISION 
| | 
| 
ACCEPT the | 0.K TYPE II | 
Hypothesis | nile EXROR 
| 
REJECT the | TYPE I | - 
| Hypothesis | ERROR _— 


| L ! 
Figure 1. Four Possible Situations Resulting 


from an Application of an Industrial 
Significance Test. 











TESTING STATISTICAL HYPOTHLSES 


Just as in the control chart procedure, let us consider that the 
reason for gathering and analyzing data is the need for analytical 
assistance in deciding between two courses of action, say A and B. 

The course of action that we should take would be clear if we only knew 
the precise nature of the population. However, because we lack this 
complete knowledge, we will make a decision based on some statistic of 
a random sample taken from the population. 


Consider the following scheme to place the,simple hypotheses into 
two separate piles and label them as follows: 


H.: all simple hypotheses whose truth would make 
A the better action. 


H,* all simple hypotheses whose truth would make 
B the better action. 


The following example serves to illustrate the above, 


Example 1. A coin which we have not examined is to 

be tossed. We assume that the coin is a "standard 
honest" coin. However, it might have two heads or 

two tails. We adopt the following test: toss the 

coin four times, if all heads or all tails occur, 

we reject the coin as standard. Let p denote the 
probability of tossing heads in a single toss. 

The hypothesis being tested, Hy» and the alternative, 


Hs are? 


Hot Pp 6 and Hs p*=0O or p#=l 


Here, Hy is a simple hypothesis and Hy is a 


composite hypothesis. The probability of rejecting 
Hy when Hy is true is a= 1/8 The probability 
of failing to detect a non-standard coinis B = 0 
for either of the simple hypotheses which make up the 


composite Hi) 


Tests of Population Mean = Population Normal and Standard Deviation 
Known = A Two-sided Test. 


Consider the case in which the sample size is prescribed by 
considerations beyond our control. In this situation, we are able 
to determine only the significance level a, and then to accept what- 
ever happens to us, good or bad, in the way of power with which to 
detect the fact that the nypothesis is false - although this may be 
our primary motive. The following steps will prove helpful for testing 
a statistical hypothesis. 


1. Choose the random variable and decide on the a priori 
conditions which are not under test but are considered known. 


2. State the hypothesis Ho together with the alternative 
hypothesis H}. 
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Specify the test statistic. 


Se Ww 
° 


Denote the sample size and the significance level a. 


« Determine the critical region. 


oO VI 


- Take a random sample, calculate the value of the test statistic, 
and make a decision to either accept Hy or to reject Ho 


Consider the following example of testing a statistical hypothesis. 


Example 2. We are interested in a measurement on a 
manufactured part, and would specify that the long 
run process average of the dimension should be, say, 
‘=a 25 in order that the bulk of the parts 


function satisfactorily when in an assembly. We take 

a sample of 16 and find the sample mean to be X& = 21, 
Assume from past data that the distribution of the 
dimension may be considered normally distributed (for 
practical purposes) and that the standard deviation 

of the process seems to remain constant 

and is o# 6. On the basis of the sample information, 
have we reason to believe that the process has shifted? 


The following are the above recommended steps for testing a 
statistical hypothesis as they apply to example 2. 


1. The random variable is the measurement on the manufactured 
part. The a priori conditions not under test are that the 
random variable is distributed nomally with constant 
standard deviation o = 6, 


2. The hypotheses to be tested are 


Hs p= 2 H_$ 25. 
3. The test statistic, for this particular test, is 
get Ho 
My 


The random variable 2 is distributed normally with mean 0 
and standard deviation l. 

4. Let the significance level a = 5% and the sample size 
n= 16. 

5. We wish the critical region to reflect that a dimension 


either too large or too smail will not function satisfactorily. 
The critical region expressed in terms of z becomes 





z< -1.96 and z> 1.96. 
(In terms of the sample average _ the critical region is 
X € 22.06 and X > 27.9.) 
6. For our particular sample we calculate 2z to be 
21 = 25 


zs “ter = «2.67 
16 
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Since this value of z lies in the critical region, the 


hypothesis Hy is rejected. 


We have discussed the significance test with regard to the proba- 
bility of rejecting the hypothesis when it is true that the mean is 
actually 25. What is the probability of rejecting the hypothesis when 
the process has actually shifted to 22? 26? It begins to look as 
though we had considered only one point, oO = 25, whereas all of the 


values around 25 might be of great interest. 


OPERATING CHARACTERISTIC CURVES AND POWER CURVES 


We shall introduce the properties of the operating characteristic 
curves for tests of significance by discussing the OC curves for accept- 
ance sampling plans. 


When considering a sampling plan, we are interested not only in the 
AQL and its associated a; in addition, we are interested in knowing 
what the sampling plan will do when lots of quality other than the AQL 
are submitted for acceptance. The chance that a lot has of being 
accepted depends upon its lot quality. What ability has the plan to 
detect these differences in lot quality? The OPERATING CHARACTERISTIC 
curve gives this information, that is, the probability of accepting a 
given lot of any percent defective. 


Many of the ideas of accepting sampling, including the operating 
characteristic curves, carry over immediately to significance tests. 
How will our significance test behave when the true process level wu 
is not hl It is essential that we have an OC curve which will apply 


to the significance test. These curves exist, (4) The OPERATING 
CHARACTERISTIC CURVE FOR A STATISTICAL TEST gives the probabilities of 
accepting the hypothesis corresponding to any alternative process level. 
Some statisticians like to talk in terms of a POWER CURVE for the test. 
The power curve gives the probabilities of rejecting the hypothesis 
corresponding to any alternative process level. Since the probability 
of rejection is one minus the probability of accepting, we see that the 
power curve gives exactly the same information as the OC curve. Either 
curve can be used to judge the adequacy of the test. However, in this 
paper, we will construct the OC curves for the tests. 


Operating Characteristic Curve for the Two-sided Test. 


To derive the OC curve for the above test, we compute the proba- 
bility that 2 falls between the following two quantities, 


= 


con « P [22206 = # 27.94 =p 
oC function P aa ‘ Zz < “3* 


where u and o are the parameters of the normal distribution and n 
is the sample size. The quantities o=6 and n#=16 are given as 
known. 
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For the various values of uu the OC can be developed as follows. 








22.06 =u 


(4) 


OC Curve - 
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gives the OC curve associated with the above tabular values. 


OC Curve for the Two-sided Test of the 
sp = 25 with a= .05 


given 


—" 





27 


Long Run Process Average 


31 


Tests of Population Mean = Population Normal and Standard Deviation 
Known - A One-sided Test. 


In the previous section we considered a two-sided test because of 
the nature of the critical region which was dictated by the hypotheses 
and H.. 











Example 3. Consider a change in the nature of the 
hypotheses in example 2, and that 


Hi u < 25 and Hind 25. 


The critical region defining the test becomes one-sided and of the 
form 2 % ky where the constant k is determined by the significance 
level a. If a#= 0.05, the critical region is given by 2 > 1.645. 
The other steps of the analysis remain the same as before. To derive 
the OC curve for this test we compute the following probability 


OC function = P | z< a =p 
= 


n 





where » and o are the parameters of a normal distribution and n is 
the sample size. In our problem it is given a priori that o=6 and 
n= 16. The OC curve for this test is given in Figure 3. 


Figure 3. Operating Characteristic for a 
One-sided Test. 


Ht wx 253 A? » > 253; a = 0.053 and n= 16. 
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Long tun Process Average iu. 


Determination of the Sample Size = Given Two Values of the Population 
Mean with Corresponding a and B. 


In the above problems we arbitrarily chose the size of the sample 
and the significance level. However, step number 4 of the "steps for 
testing a statistical hypothesis," could read "Choose two values of the 
parameter and the corresponding values of a and 8B, and find the 
sample size n." The remainder of the steps do not change. 
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Example 4. Consider again example 2 which involves the 
two-sided test with the hypotheses 


Ho? » = 25. and H,? yw ¥ 25 
where the standard deviation is known and o = 6. 
However, in this situation suppose that we have object- 
ively evaluated the probabilities of making wrong 
decisions and have set a level of significance a= 0.05 
for uw = 25, In addition, we state that a process average 
which has shifted to 21 or 29 should only be accepted 10% 
of the time. 


Let n_ be the size of the sample to be taken and let xy 
and xX 
of the critical region to be specified in terms of the 
sample average. We will assume that the sample averages 
are normally distributed. The following equations 
describe the above conditions which define the test: 


be the uoper and lower iimits of the two sides 


= & 1.282 
we 


yo = + 1.282 


1.960 


of! 


Here we have three independent equations in three unknowns. If we 
eliminate Xy from the first and third equations, we obtain 


— wa ks = 23.65 | 





n = 23.65 


We will use n = 24. Now if we substitute n = 2) in both the first 
and second equations, we find that Xy = 27.4 and that x = 22.6 


which give the boundaries of the uoper and lower segments of the 
critical region that defines the test. 
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The operating characteristic curve for the above test can be found 
in a similar manner to that previously used for the two-sided test given 
for example 2. 


Tests of Proportions - Large Samples 


A similar procedure to that described above can be used to test 
hypotheses regarding population proportions. That is, we might be 
interested in testing whether an industrial process is operating in a 
random manner to produce a specified fraction defective of units. It is 
very common to determine the critical region and sample size correspond- 
ing to a given set of hypotheses by using the normal approximation to 
the binomial distribution. 


SUMMARY 


In many instances the quality control engineer is faced with a 
situation which demands that he decide on either of two causes of 
action, A or B. There are statistical methods which deal with the 
gathering and analyzing of data for the purpose of analytical assistance 
in deciding on the course of action*to be taken. The suggested steps 
offered to test significance can serve to help the engineer state the 
problem together with deciding upon the probabilities of making wrong 
decisions. 





We should emphasize that though we draw a conclusion as the result 
of the statistical test, the test doesn't prove anything. If it turns 
out to accept Ho» this does not mean that Hy is necessarily true. 


We do not have to believe either that it is or that it is not true; 
we have a decision as to which of two actions to take. ‘The test is 
simply a rule for decision, and one which turns out all right in many 
cases. 
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SIGNIFICANCE TESTS II - SMALL SAMPLES 


Max Astrachan 
Air Force Institute of Technology 


In the first paper in this series we have discussed the basic phi- 
losophy and concepts behind tests of significance, and how such tests 
are related to control charts. In addition, we have seen how to con- 
struct one-sided and two-sided tests on the mean in the case in which 
the population standerd deviation @ is known. Our procedure was based 
on the fact that the means of samples of any size from a normal popu- 
lation are normally distributed with mean equal to the population mean, 
and variance (square of standerd deviation) equal to the population 
variance divided by N, the sample size. We used the test statistic 


2: — i ’ (1) 


which is normally distributed with mean O and veriance l, X is the 
observed sample meen and Fo the hypothesized population mean. 


We cannot use z for a test statistic when @ is unknown, although 


it seems logical to replace o in (1) by s, the sample standard devia- 
tion*, the resulting expression 


¢<s X= fo = (2) 


is no longer normally distributed. Instead it follows what is known as 
"Student's" t distribution. 








*sample standard deviation is defined as the square root of the sum of 
the squares of the deviations of the measurements from their mean 
divided by N-l. In symbols, 





= (xX, -%) 


- ™ 


N-! 





The quantity N-l is celled "degrees of freedom," abbrevieted df, and we 
commonly say that s“ has df = N-1. For ease of calculation, especially 
when a desk calculator is available, it is often convenient to replace 
the numerator by the following expression, to which it is algebraically 
equivalent: 


sx (2x) 
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For a given sample size and for a population of known mean, z con- 
tains but one quantity, X, which changes from sample to sample. However, 
t has two such quantities, X and s. Hence it seems reasonable that t 
fluctuetes more from sample to sample than does z. 


There are, in fact, many t-distributions, one for each value of df 
(degrees of freedom) associated with the sample variance. For (2) we 
always have df = N-1. Each distribution curve is bell-shaped and 
looks like the normal distribution curve but has longer "tails." 

Figure 1 shows severe] t-curves and the normal curve. As N increascs, 
the t-distribution approaches the normal distribution with mean O and 
variance l. 


0.44 
: 


0.34 


0.24 











Fig. 1. Distribution curves of t for df = 1, 5, 10 andeo,. 
(The curve for df = oO is the standard normal curve) 


In order to use the t-statistic to test hypotheses, we shall need 
to know percentile points corresponding to the levels of significance we 
want to set. Tables of these values are given in most standard books on 
statistics. (See for example /17, p 384.) Since there is a different 
curve for each sample size, there will be many sets of such points. 
Table 1 gives a portion of a table usually found in the literature. The 
particular t-distribution is specified in the column headed df (= N - 1), 


Table 1 - Some Percentile Points of the t-Distribution 








Percentile 

af 095 0975 99 
5 2.015 2.571 3.365 
10 1.812 2.228 2.764 
15 1.753 2.131 2.602 
20 1.725 2.086 2.528 
25 1.708 2.060 2.485 
30 1.697 2.042 2.457 
60 1.671 2.000 2.390 
120 1.658 1.980 2.358 
o 1.645 1.960 2.326 
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and the percentiles are indicated at the top of the table. Because of 
the symaetry of the distribution, the .05, .025, and .0l1 percentile 
points are the negatives of the .95, .975,and .99 values respectively. 
Note that the values in the last row, for which df = co , are those for 
the normal distribution. Since they do not differ much from the per- 
centile points for large df we often use z in (1) as a test statistic 
when o is unknown, but the sample size is large. In such cases, we 
replace o by its estimate, the sample standard deviation s. 


With this brief description of the t-distribution, we are now in a 
position to perform tests of hypotheses on the mean in the case in which 
the population standard deviation o is unknown. The procedures of the 
first paper in this series are applicable, the only difference will be 
in the test statistic and the critical values. As we shall see later, 
we shall be able to perform other tests of tnypotheses also, by use of 
the t-distribution. 


We shall illustrate the method by means of the following example: 
- The weights in pounds of a sample of 6 castings are as 
follows: 49, 55, 53. 57, 48, 53. Is the process consistent with the 
specified mean of 50 lbs? 


Applying the procedure of the first paper we then have the follow- 
ing steps: 


1. The random variable is the weight of a casting. We assume 
these weights are normally distributed. 


2. Hypothesis to be tested: The long run process average is 
50 lbs., i.e., H: las = 50. The alternative hypothesis is Hj: # 50. 


3- The test statistic is 





which we assume has a t-distribution with 5 degrees of freedom 
(af = N - 1 in this case). 


4. Choose a significance level, X= 0.05. 


5. From Table 1, we see that the critical region for this signifi- 
cance level is t $ - 2.571 and t 2 2.571. 


_ 6, From the given data,£=X; = 315 andZX,* = 16,597, whence 
X = 52.5, and from the formule in the first footnote, 





o =\i 16,597 - (315)* 


6: rn - 11.9 = 3.45. 
; Y's 
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Substituting into the expression for t in step 3 above, we have 


t = 5225-50 - 2.5 
345 1.41 
6 





= 1.77 


Since this value of t is not in the critical region we accept the 
hypothesis H,, that the process mean is 50 lbs. 


The above is an example of a two-sided test, which was dictated by 
the manner in which the question was asked in the problem. Had the 
question been "Is the process consistent with the specification of mean 
weight not greater than 50 lbs?," our hypotheses would have been 


H,: KE 50, Hy: 4750. 


The critical region then becomes one-sided and is of the form t 2 k, 
where k is determined by the level of significance and the df. In our 
case, where & = 0.05 and df = 5, the critical region would be t = 2.015. 
The remainder of the analysis is the same. Whether we are going to 
make a one-sided or a two-sided test must, of course, be decided in 
advance, 


COMPARING TWO POPULATION MEANS 


The problem of comparing two populations with respect to some 
characteristic is one that frequently arises in practice. In these 
situations, one course of action would be appropriate if the two popu- 
lations may be considered the same with respect to the characteristic in 
Question, and another if they are not. 


The following questions suggest the sort of problem one meets in 
this area: Does a population of smokers have greater susceptibility to 
disease of a certain type than a population of non-smokers? Does 
fertilizer "A" work better than fertilizer "B" on the average? Does a 
certain new piece of equipment produce bar stock with greater varia- 
bility in dimensions than the old? Are men better drivers than women? 
Does one method of measuring the concentration of a chemical have more 
inherent variability than another? Does machine "A" produce more 
defective parts than machine "B"? 


Two random variables may differ with respect to level or centering 
as measured, for example, by their means; they may also differ with 
respect to variability or repeatability as measured by their standard 
deviations. There are other differences, of course, but these two are 
usually the ones in which we are interested. In this section we shall 
describe several tests which may be used for comparing two population 
means. As we shall see, the appropriate one to use depends upon the 
information available. Tests on variability will be covered in the next 
paper. 
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Suppose we have two samplesof size N) and No from each of two 
populations with means 44}, A@fo.» and variances 71°, To, respectively. 
To test the hypothesis that the two population means are equal, we shall 
use the test statistic 


(3) 








where % and % are two sample means. If the two populations are normal 
and “4, = Mo, then z is a standard normal variable (with mean O and 
variance 1). 


The following example illustrates the use of this test statistic: 


Example 2. A shearing pin has a hexagonal head and is made on two 
different machines. A sample of 100 is taken from each of the machines 
and the Rockwell hardness determined for the head of each pin. The 
following results were obtained: 


Machine _N_ xX ,™ 
1 100 41.63 1.8 
? 100 40.67 2.0 


Is there a significant difference between the average hardness of the 
heads produced by the two machines? 


Although we do not know the population variances, we shall assume 
that the sample sizes are large enough so that we can replace them by 
the sample variances in (3). 


The test then proceeds as follows: 

1. There are two random variables: X) and Xo, the hardnesses of 
the pins from the two machines. If these are normally distributed, the 
distribution of z in (3) is normal with mean O and variance l. If not, 
we assume the sample sizes are large enough so that z is approximately 


normally distributed, even when we replace the population variances by 
sample variances. 


2. Hypothesis to be tested: KH): fi = Ka: The alternative 
hypothesis is Hy: (4) # Ko. 


3. The test statistic is given by (3), which we assume to be 
normally distributed. 


4. Choose the level of significance A= 0.05. 


5. The critical region is z £ -1.96 and 22 1.96. 
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6. Substituting the given data in (3), using the known sample 
variances in place of the unknown population variances, we have 


41.63 - 40.67 _ .% 
z= = = 3.5/7 
as P (2.0)2 269 30ST 


100 100 





Since this value of z is in the critical region we reject H, and con- 
clude that on the average the hardnesses of the pin heads produced by 
the two machines are different. 


When the samples are small we cannot replace the population vari- 
ances in (3) by sample variances, for then the statistic is no longer 
normally distributed. We have to distinguish between two cases, one in 
which the variances of the two populations are equal, and the other in 
which they are unequal. Tests for the equality of variances will be 
given in the next paper in this series. 


Case 1 - Equal Population Variances. In this case we combine the 
two sample variances to give us an estimate of the common population 


variance, This is given by 


(N - 1)8,° + (N, - 1)s,° 
8,2 = 1 2 E... (4) 
N, +N, - 2 





We refer to s 2 as the pooled or averaged estimate of the common popu- 
lation variance o 2 obtained from the two sample variances s,“ and 85°. 
The test statistic used to determine whether two population means are 
equal or not is then 
‘ X%, - % 

- ’ (5) 





which follows the t-distribution with N, 4 N, - 2 degrees of freedom if 
Pr = Hoe 


Example 3. Following are samples of the breaking strengths of two 
types of cloth: 


Type 1: 61 52 45 46 
Type 2: 48 71 69 68 58 70 64 
Is there a significant difference between the mean breaking strengths? 


From the given data 


X% = 51, Xo = 6h, 83° = 5h, 80% = 69.67 
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whence from (4) 


2. _3 = 5h ¢ 6 x 69.67 
4+7-2 


= a = 64.bh, 





®p 


so that 
8p =V64.4 = 8.03. 


Following the procedure of the other examples, then, we have the 
following steps: 


1. There are two random variables X, and X,, the breaking 


strengths of the two types of cloth. We are assuming the variances of 
the two populations are equal. 


2. We are testing the hypothesis Ho: #1 =KMa against 
Hy: My Eu 2 


3. The test statistic is given by (5) which we assume follows the 
t-distribution with 9 degrees of freedom. 


4. Choose the level of significance % = 0,05, 


5. The critical region is t £ -2,262 and t 2 2.262. (This is 
found from a more complete table than Table 1.) 


6. Substituting the values found above in (5), we have 








¢ 51 - 64 -13 

s s 

| oo) (8.03) (.6268) 
4 7 

= -2.58. 


Since this value of t is in the critical region we reject H, and con- 
clude that the mean breaking strengths of the two types of cloth are not 


the same. 
Case 2. Population Variances Unequal. In this case, assuming the 
populations are normally distributed, the statistic 





(6) 





be 
2 


~<a 
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has approximately a t-distribution if the two population means are 
equal. The number of degrees of freedom is given by 


2 2\2 
($+) 
Ny No 


2 2 
2 2 
. = 
N N 
1 2 
7 


N, +1 N, +1 

















which is usually not an integer. However, the closest value in the 
table will generally be sufficient. 


Exemple 4. The diameters of ten samples each of wire produced on 
two different machines were measured with the following results (data 
in thousandths of an inch): 

Machine 1: 59 56 68 59 56 62 59 58 S57 56 
Machine 2: 57 55 59 56 S57 Sh Sh 5S? 55 5h 


Is there a difference between the mean diameters of the product of the 
two machines? 


From the given data, 
X, = 59.0, X= 55.8, #,° = 13.56, 8,° = 2.8h, 
and from (7) 


2 
(1.356 + .284) - 22 15.4 - 2 = 13.4. 


(1.356)° + (.284)* 
ll 11 


af = 








The steps in performing the test are then as follows: 


1. There ere two random variables X; and X5, the thicknesses of 
the wire from the two machines. We are assuming the populations are 
normally distributed and their variances unequal. 


2. The hypothesis to be tested is Hj: , = M2, against 
Hy: # Moe 


3- The test statistic is given by (6) which we assume has 
approximately a t-distribution with degrees of freedom given by (7). 


4. Choose the level of significance o= 0.05. 




















For 13 degrees of freedom the critical region is t = -2,160 
160, whereas for 14 degrees of freedom, it is t # -2.145 and 


5 


t 
2. 


wiv 


nd 2. 
2 2.145. 


6. Substituting the values found above in (6), gives 


t+ Dp 


es 59.0 - 55.8 on 2.50. 


13.56 , 2.84 1.26 
10 10 





Hence, we reject the hypothesis of equal means. 


Again, as in the case of Example 1, we could have conducted one- 
sided tests in the last three examples had the questions been suitably 
asked. Thus, in Example 2, suppose we had been interested in knowing 
whether or not the average hardness of the pins produced by machine 1 
was significantly greater than that produced by machine 2. Then we 
would be testing the hypothesis 


Hy: yy = Ko against (M7 Mo, 


and, at the .05 level of significance, the critical region would have 
been z 2 1.645. 


PAIRED MEASUREMENTS 


We have been assuming in all of the tests comparing two population 
means that the samples are independent. A special case when these are 
not independent occurs when the observations are paired. For example, 
we might be compering two different devices for measuring tensile 
strength in order to determine whether or not they give the same 
results. The values obtained would vary together from piece to piece, 
and, hence, would not be independent. In this case the test is carried 
out on the differences of the paired measurements assuming they form a 
population whose true difference is zero. The procedure is illustrated 
in the following example. 


e 5. Two analysts, "A" and "B", analyzed a series of 


chemical mixtures to obtain the percentage of one of the ingredients 
common to all of them. Following are the results: 


Mixture 1 2 3 & 5 6 7 _ 8 
Analyst Az 5 7 6 8 6 9 





Analyst B: 9 5 8 8 7 8 8 7 





Difference, as: -4 2 -2 Oo -il l -#l «2 


Are the analysts reading alike? 
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These are obviously not independent samples since each pair of 
readings was made on a different mixture. From the row of differences, 
we have then 


= 4; = -6, Za,? = 28; a: -.75, 





2p - (-6)° 


84 = —ooclicem = y 3-36 = 1.83. 


As in the case of the preceding examples, we have the following 
steps: 





1. Our random variable is d, the difference between the readings 
of the two analysts on the mixtures. 


2. Hypothesis to be tested is that the mean of these differences 
is zero, i.e., H,: KM, = 0, against Hy: Ma #0. (Ka is the mean of 
the population of differences.) 


3. The test statistic is 
Tpke. © 
84 


{nx 


which we assume follows a t-distribution with df = N-1:27. (Here N 
is the number of sample differences.) 


7) | 





he Choose a significance level A= 0.05. 


5. The critical region is then t $ -2.365 and t 2 2.365. 


6. Substituting the results found above into the test statistic 
of step 3, we have 


t= =75 - 0 e =e75 s -1.16 
1.83 -647 
Ve 


Since this value of t is not in the critical region, we assume the 
two analysts are reading alike. 





We have not included the operating characteristic curves for the 
tests described in this peper. They are given in [2/7 and [37, 
together with some examples of their uses, Table 12 in /17 includes 
power curves for some of the tests. The form in which it is given makes 
the table particulerly useful for some types of problems. 


Other methods of testing hypotheses of the form discussed in this 
paper are also available. Some of these involve use of the range 
instead of the standard deviation. Several alternate tests are des- 
cribed in /h7 and £57. The quality control engineer who is planning 
to use significance tests would do well to examine other ways of making 
them in order to determine which are best for his particular purpose. 


It should be emphasized again that significance tests are simply 
guides to help us make up our minds as to which course of action seems 
best to follow in a given situation, other things being equal. The 
quality control engineer must also consider the practical implications 
of the recommendations he makes to management. For example, a statisti- 
cal significance test may show that a new machine would increase our 
production rate by 10 units per day. However, the cost of buying and 
installing the new machine may be more than the added profit realized 
from the greater production. Significance tests prove nothing in them- 
selves, but they can be helpful in deciding what to do when we are faced 
with alternatives. 
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SIGNIFICANCE TESTS III = VARIANCES 


Fred C. Leone 
Case Institute of Technology 


To continue the discussion of significance tests, we now turn to 
tests on the variance. We have considered the basic thinking of tests 
of significance, the risks, the assumptions, and procedures. Further 
tests on the sample mean, X, were explained for both large and small 
samples. One of the points mentioned earlier in tests of significance 
on the mean of two samples was an assumption of equality of variances. 
That is, in a particular test of significance on sample mean Xx, versus 


sample mean X» it was assumed that the variances 3,” and on” were 


equal. This is equivalent to saying that the variances of the popula- 
tion from which each sample came were the same. We now proceed to take 
this assumption and treat it as a hypothesis which we test. 


Tests of hypotheses on the variance may be divided into two groups, 
namely, (a) sample versus population, and (b) sample versus sample. 
In the former the type of question we consider is whether this sample 
we are proposing has come from a normal distribution with a variance 
equal to some specified value. In the latter case we consider a ques- 
tion like the following: Could these two samples have come from normal 
distributions with the same variance? (Here we do not specify what 
this population variance is.) We may also include in this latter cate- 
gory tests on the homogeneity of several variances as well as tests on 
one of a set of variances being significantly larger or significantly 
smaller than the others. 


Consider the following example. Castings have been produced with 


a variance (o*) equal to four pounds. A new process has been proposed 
which is much more economical. This change will be adopted provided 
that the variance of this is the same as the proposed variance of four 
pounds. A sample is to be taken and a decision is to be made as a 
result of the test. 


Let us restate the procedure which was proposed in the introduc- 
Lory paper of this series. This is as follows: 


1. Choose the random variable and decide on the a priori 
conditions which are not under test but are considered known. 


2. State the hypothesis, Ho» together with the alternative 
hypothesis, H,- 

3. Specify the test statistic. 

4. Denote the sample size and the significance level. 


5. Determine the critical region. 


6. Take a random sample, calculate the value of the test statis- 
tic, and make a decision to either accept Ho or to reject Hos 
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The steps of this test for the example, then, are as follows: 
1. The random variable is the weight of the casting. The a priori 
assumptions are (a) that this variable is distributed normally 
with some unknown mean (uy) and unknown variance (o*) and (b) 
the sample is randomly chosen and its elements are independent. 
2. The hypothesis and its alternative are 


2 2 
Ko : 0 o, (= 4) 


2 2 
H, +0 # o, 
3. The test statistic is 


2. (N= 1)s* 
2 


o 
° 


x 


where s* is the variance from the sample and is defined as 
w\2 
z (X, X) 


Nel 


4. Let us specify a sample of size 6 and the significance level 
a@ equal to 0,05. 


$. The critical region is 


x” < 0.831 and x* > 12,85 


6. The sample, the test, the conclusion’ If 0,831 < re < 12.83, 


accept the null hypothesis (Hp). If x” < 0.831 or x” > 12.83, 
reject the hypothesis. 


In order to determine the critical region of rejection of the test 
it is necessary to make use of Table 1. This table presents the percen=- 


tiles of the test statistic, x". The x distribution is a family of 
probability curves. The members of the family change with different 
values of degrees of freedom, v. Here v is N-l. Figure 1 presents 
several of the curves of the probability distribution. Note how the 
curves tend toward a symmetrical distribution as N increases. The 


abscissa of these curves is x and the ordinate is f(x"). The area 
under each curve is, of course,equal to 1. Incidentally, the mean of 
each of these curves is v. 


Turning now to Table 1, the row designations are the degrees of 
freedom and the column headings, the percentages. The numbers within 


the body of the table are the values of < which give the designated 
probability to their left. For example, for v equals 5 degrees of free- 


dom, the value re = 0.831 has 2.5 percent of the area under the curve to 
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Figure l. ‘. Distribution for v = 2, 4, and 10. 


the left of it. The value re = 12.83 has 97.5 percent of the area to 
the left. (This is equivalent to 2.5 percent to the right.) If we 
wish our a to be 0.05, we can have 2.5 percent on each tail of the 
distribution. 


Let us continue the problem. A random sample is taken with the 
following weights of castings in pounds, as presented in a problem in 
the previous paper: 


49, 55, 53, 57, 48, 53 


From this we calculate s* #11.9. Finally 


ea Mads" 
3 
o 
° 


~ 50:9) « 14,9 


x" is then in the critical region, that is, 


x (= 14.9) > 12.83 
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TABLE 1 


PERCENTILES OF THE x DISTRIBUTION 





Per Cent 








N 0.5 1.0 205 520 95.0 97.5 99.0 99.5 
2 0.000 0.000 0,001 0,004 3,84 5202 6.63 7.88 
2 0.010 0,020 0.051 0,103 5.99 7238 9.21 10.60 
3 0.072 0.115 0.216 0.352 7,81 9.35 11.34 12,84 
4 0.207 0.297 0.484 0.711 9.49 11.14 13.28 14.86 
5 0.412 0.554 0.831 1.145 11.07 12.83 15.09 16.75 
6 0.676 0.872 1.24 1.64 12.59 14.45 16.81 18.55 
7 0.989 1,24 1.69 2el7 14.07 16.01 18.48 20,28 
8 1.34 1.65 2.18 2073 15.51 17.53 20.09 21.96 
9 1.73 2.09 2.70 3.335 16.92 19.02 21.67 23.59 
0 2016 2656 3225 3.94 18.31 20.48 23.21 25.19 


15 4.60 023 6.26 7026 25.00 27.49 30.58 32.80 
20 7243 8.26 9.59 10.85 31.41 34.17 37.57 40.00 
30 13.79 14.95 16.79 18.49 43.77 46.98 50.89 53,67 
40 20.71 22.16 24.43 26.51 55.76 59.34 63.69 66.77 
60 35.53 37.48 40.48 43.19 79.08 83.30 88.38 91.95 





We, therefore reject the null hypothesis. On the basis of the sample 
presented we have information to state that the new process, though 
more economical, is not producing a product which is equally as var- 
iable as the established process. Our sample points to a significant 
difference in variances. 


Let us stop to compare the meaning of rejection as opposed to 
acceptance. If we reject a hypothesis with a specified a, then we can 
state that we have a l=a probability that this sample did not come from 
the proposed population. On the other hand, if a hypothesis is ac- 
cepted, we simply state that we are willing to "go along" with it. We 
shall act as if it is true. In some instances we may have to state 
that we simply do not have enough information to negate the hypothesis. 


Sometime ago this analogy was presented to me: A result of a 
test of hypothesis is equivalent to the result of a law court. A man 
is either found guilty (rejection of hypothesis of innocence) or not 
guilty (acceptance of the hypothesis). In the former, we have enough 
information to convict him. In the latter, we either do not have 
enough information, if he is guilty, or he is really innocent. We do 
not know which is correct. 


At this point it is well to consider the operating characteristic 
(OC) curve of this test of significance. Recall that a is the prob- 
ability of rejecting the null hypothesis, Ho» when it is true. 8 is 


the probability of accepting the alternative hypothesis Hy when Ho is 


506 














true. For a sample of size 6 the critical values of x are 


x” = 0,831 and 12.83 
Our first step is to translate these critical values of x” to 
critical values of s*. By using the equation below, with s* unknown, 


we have for each x” 


(N - 1)s* 2 
———— 
o 
° 


Substituting for N, 3, and the critical value of x" we have 








2. = 12,83 
4 


This gives us the critical values of J, namely, 


2 
sy e665 


2 
So 10.26 
Suppose we now consider alternative hypotheses. For the hypoth=- 


9 
esis Hy : 0° = 9 (= o,*) the value of 8 (error of the second kind) can 
be calculated as follows: 


2 _ 5(.665) . 
x 3 2369 


2 6 
Xg = S(20526) . 5.700 


B is the probability that x" is between x" and Xo" for 5 degrees of 
freedom, that is, 


2 


P(.369 < x~ < 5.700) = .65 


This same procedure is continued for other alternative hypotheses, 
These results are presented in Table 2. The first column presents o; 
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TABLE 2 2 2 
CALCULATION OF OPERATING CHARACTERISTIC CURVE FOR H) to = 0% (= 4) 
FOR 5 DEGREES OF FREEDOM ° 


2, 5(.665) 2. §(10.26) - Play Paxrex,”) 








0, 0,” x . 2 og 

(1) (2) (3) (4) (5) 
1 3.325 51.30 65 
2 4 0.831 12.83 295 
z 9 0.369 5.700 65 
4 16 0.208 3.206 233 
7 25 0.133 2.052 +16 
. 36 0.092 1.425 08 
7 49 0,068 1.047 04 
- 64 0.052 0,802 .02 
9 81 0,041 0.633 Ol 





for the alternative hypotheses (o* . ab where 3,” is given in col= 


umn 2. Column 3 is the lower limit of < for the alternative hypothesis, 
while column 4 gives the upper limit. Finally, B is calculated in 


column 5. This is the area under the x” curve for 5 degrees of freedom 
between the two limits, x" and “ 


The results of Table 2 are presented in Figure 2. The abscissa 
is the value of o,* i 
is also stated. The ordinate is the error of the second kind (B) if 
the alternative hypothesis is true and Ho is accepted. For comparison, 


for the alternative hypothesis. A scale for o 


a second operating characteristic curve is presented by a broken curve 
for the similar problem with 15 degrees of freedom rather than 6. Many 
authors prefer to use 1-6 as the ordinate. This figure of the compli- 
ments of B is called the power of the test. 

In the previous example you will note that the test was two sided. 
We asked beforehand whether o” was significantly different than a”. 
Suppose, on the other hand, we ask whether o* is significantly greater 
than o,*. This is a one sided test, and our single critical value for 
the same sample size and a = 0.05 would be x = 11.07. If then 


x < 11.07 


we accept the null hypothesis. Otherwise we reject the hypothesis. 
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Figure 2. Operating Characteristic Curves for H) : o” = 3," (= 4) 
for 5 and 15 degrees of freedom. 


Let us turn now to the problem of comparing sample variances from 
two different populations. From each population we take a sample and 
determine its variance. Are these variances compatible? In other 


words, could these samples have come from normal distributions with 
equal variances? 


In setting up the general steps in testing a statistical hypothesis 
we have: 


1. The random variable is the measurement, X,> and the a priori 
conditions are again, normality, randomness, and independence 
of the values in each of the two samples, 

2. The hypothesis is 


2 2 
Hy $ o) o 


where o,” is the variance of the population from which the 


sample one came, and on. is the other variance. 


3. The test statistic is 


2 2 
F=s, / So 
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where s,” is the variance from sample one, and 85° is the 
variance from sample two. For this test we shall designate 
as sample one that sample which has a larger s*. Hence, we 
2 2 
y > &° 
use of the table. 


require s The purpose of this is to simplify the 


4. The sample sizes are respectively N, and N,. Consider an a, 


1 2 
say, a = 0.05. 

Let us again present an example which was proposed in the preced- 
ing paper. The breaking strength of two types of cloth was tested for 
a significant difference of means. The variances of the sample were 
assumed equal. We now take this assumption and test it as a hypothesis. 
The two samples of breaking strength in pounds per square inch (psi) 
are: 


Sample As: 61 52 45 46 
Sample B: 48 71 69 68 58 70 64 


If we calculate the variances from the sample we obtain 


s,” = 54 


8, = 69.7 


S oc = Dea 2 2 
Let us call Sp > and Ss, So in order that Ss) > Sos 
The particular steps in the test of significance then become: 


1. X is the breaking strength in psi. 


2 2 

2. Ho $ oy 7 oy 
2 2 

Hy ? O° # oO 


2 2 
3. Fes, / 85 


4. Sample (1) of size 7 (Sample B) 
Sample (2) of size 4 (Sample A) 


, riti is F > 
5. The critical region is F > F975, 6, 3 


6. The samples are taken and F is caleulated. If F < 14.7, 
accept the hypothesis that the sample variances are equal, 
lf F > 14.7, reject the hypothesis. 


To determine the critical values it is necessary to consult 
Table 3. In this we have the percentiles of the F distribution for 
4 degrees of freedom in the numerator, and Yo degrees of freedom in 


the denominator. The F distribution is a family of curves which has 
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two parameters, 4 and Vos The statistic F is the ratio of two sample 


variances where each of the samples was originally drawn from a normal 
distribution. In the table each cell of four values is made up of the 
values of F with 4 and Yo degrees of freedom, which give the probabil- 


ity as stated in each row. This probability is the area to the left 
under the F distribution. 


For a two sided test of hypothesis, the critical value of 
2 2 
F= 8, / 8, >1 


for a specified a is that tabular value which has a probability of 
l-a/2. The nature of F is such that by requiring that F be greater 
than 1 we need only consider an a/2 area on the upper tail. In 

Table 3, then, for v, = 6 and v, = 3, the .975 row gives us a value 
of 14.7. i 


To conclude this example, we calculate 


Fo 


= 1.29 


Since F<F one ¢ 3 (= 14.7), we accept the hypothesis of equal var- 
o= > > 


iance. As done in the preceding paper, we can then continue the test 
for equality of means. 


Note that the above was a two sided test. If, however, before 
actually taking the samples, we suspected that one variance would be 
larger and asked, for example, "Is variance B greater than variance A?® 
we would have a one sided test. The critical value of F would then be 
F 195, 6, 3 which is equal to 8.94. 


There is a further set of tests which should be mentioned at this 
time. These concern the homogeneity of several variances. Suppose k 
samples were taken and we wish to know if the variances were compatible. 
The hypothesis would then be: 


One of the tests commonly used is Bartlett's test. This can be found 
in most books on statistical methods. 


A test known as Cochran's test was devised to consider whether a 
particular one of the variance of the k samples is significantly 
larger than the others. This test is especially useful prior to the 
analysis of variance when it is suspected that a particular cell in the 
design does not have a variance compatible with the others. This equal- 
ity of variances is often assumed in the analysis of variance technique. 


511 








Recently a test was devised by R. Doornbos” to test the significance 
of the smallest of a group of variances. 


In conclusion, let us state that we have considered a comparison 
of a sample variance with a proposed population variance. The proposed 
test was supplemented by an operating characteristic curve. Recall, 
also, that if a hypothesis is rejected, we have a certain assurance of 
our action being correct. We had decided on a risk of rejection of the 
null hypothesis if it were true. On the other hand, if the hypothesis 
is accepted, we do not have this high assurance that it is correct. 

We simply go along with it. We can, however, compare the likelihood 
of this hypothesis with an alternative hypothesis by considering the 
operating characteristic curve. This same procedure of comparing a 
sample variance with a population veriance was carried into a compar- 
ison of two sample variances. Firally, it must be kept in mind that 
to simply test a hypothesis and agree or disagree with it is often not 
enough. We must continue our investigation and consider confidence 
intervals on our population parameter. If we reject a hypothesis, 

for example, the logical question to follow with is, "What about the 
true perameter of the population from which the sample did come? What 
can we say about that?" Here we get into the area of confidence in- 
tervals which is to be discussed in the next paper. 


*"Significance of a smallest of a set of estimated normal 
variances," by R. Doornbos, Statistics Neerlandica, 10, 
Noe 2, 117-26, 1956. 
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SIGNIFICANCE TESTS IV: CONFIDENCE INTERVALS 


Robert W. Boeke 
John Deere Ottumwa Works 


In the preceding papers we have been primarily concerned with deci- 
sion statistics, We have stated a hypothesis, selected a level of sig- 
nificance, calculated a critical region, then tested the hypothesis with 
the statistic calculated from our observations; based on the location of 
our statistic relative to the critical region we have either accepted or 
rejected our hypothesis. Our observations either substantiate or refute 
the hypothesis under test and we decide either to accept or reject it. 
It should be observed that we have utilized the statistics to test our 
hypothesis rather than to estimate the population parameters, Frequently 
our sampling may be performed to estimate a population parameter, 


If sampling is performed to estimate a population parameter, we may 
use either a point or interval estimate, For instance, we may draw a 
random sample from a population and use the_sample average as a point 
estimate of the_population average, Thus, X is a point estimate of ° 
However, since X is subject to sampling fluctuation we may, sapthouker’ 
for small samples, obtain a value which is misleading. The interval 
estimate is of such a nature that for a population parameter, ©), limits 
may be calculated (Lj and Lg) so that the interval may be expected to 
encompass the parameter with a probability of 1-«<, where ec is the prob- 
ability that the interval does not include the parameter. In general- 
ized form the interval may be written: 


L, <o x L, 
and the confidence coefficient is l-a, 


If a random sample of N items is drawn from a normal population we 
may estimate an interval which has a probability of, for instance, 95% 
of encompassing the population mean, 


L, < cj < lL 
and 95% of such intervals would be expected to include . The values 
Li and L2 are known as confidence limits and these limits define the 
confidence interval, 


We cannot say that ,« will be in the interval calculated but we may 
say that the probability is .95, before the sample is drawn, that the 
interval so defined will include « A rather fine distinction must be 
drawn regarding the statements which can be made, Since the parameter 
has a fixed value, it does not have a sampling distribution, Therefore, 
our probability statement must be made concerning the calculated inter- 
val, not the parameter, For instance, the statement, "The probability 
is .95 that j. lies within the interval", is incorrect since it infers 
that is &@ variable, We may, however, state that we are 95% confident 
that’ our calculated interval encompasses the parameter. Or, if we re- 
peatedly estimate the parameter in this manner, we may expect to include 
the parameter in our intervals 95% of the time, After the interval is 
calculated the parameter is either included or it is not; therefore, the 
probability that the parameter lies within the interval is either 0 or 
i. 
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Table I illustrates the wariable characteristics of the confidence 
intervals, Fifteen samples were drawn from a population with = 4O 
and o = 10, For each sample, 95% confidence intervals were Calculated 
(1) assuming o known and (2) assuming o not known, It should be ob- 
served that for this example ,~ is fixed and equal to 40, the confidence 
interval is the variable, If this experiment were repeated indefinitely, 
approximately 95% of the intervals calculated would encompass the mean, 


| tates 


Perhaps the biggest advantage of an interval estimate is that we 
not only get a point estimate of the parameter but also obtain a measure 
of the precision of our estimate, We may increase the precision, i.e. 
decrease the size of the interval, by increasing the sample size, Two 
samples may have the same average but the one with the smaller confidence 
interval indicates greater precision of estimation. A comparison of 
Tables I and II will give an idea of the relative precision for sample 
sizes of 5 and 25 respectively. Both tables were computed using 95% 
confidence intervals; the narrower limits in Table IT reflect the in- 
creased precision obtained by the larger sample sizes, 


Throughout the discussion, reference has been made to 95% confidence 
intervals, The confidence coefficient need not be 95% but may be selec- 
ted by the experimentor based upon the degree of confidence required 
under the specific conditions, Although values of 90, 95, and 99% are 
frequently employed, the individual should review his particular problem 
and need not feel hesitant to deviate from these values. 


A. Confidence Intervals Sew pro FY go known 


The sampling distribution of X has a mean equal to » & standard 
deviation, 7/s/W¥- , and is distributed according to fhe normal curve, 
The values of X are grouped about ,< in such a manner that the_interval 
p+ 1.96 &/-TW will include some known percentage, 95%, of X - values, 
Then the following interval will include the difference, ) po = X, 95 per- 
cent of the tine 


- 1.96 Sly <p - RA + 1.96 S/W 


By adding X to each term, we have 


T= 1.96 HAN << 1+ 1.965 /AlR- 


In the general case we have 
E+ cS s < I+ yew 


where 2} and 21.40 are obtained from Table IV,, and « is the risK we 
are willing to take that the interval may not include ,. . 


Example I, 

We wish to estimate the average tensile strength of a new run of 
twine, Previous tests indicate substantial variation in the mean tensile 
strength, .., but indicate that o is constant at 16 pounds. We wish to 
estimate ~ 90 percent confidence interval based upon a sample of 25 
observations with an average of 375 pounds, 
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Given: 
J7= 16 (known) 
= .10 or 1 -x= .90 


N = 20 
X = 375 
Then, 
ae - Se po & te ae 


We are 90 per cent confident that the interval, 369.7 to 380.3 lbs., 
will include the mean breaking strength of the twine. Our point esti- 
mate, or "best" estimate is 375 pounds but our interval indicates the 
precision inherent in our estimate. 


It should be observed that the confidence interval may also be used 
in conjunction with a test of significance, Had we tested the hypoth- 
esis that ~w = 370 with << = .10, we would not have sufficient reason 
to doubt that _»~~ = 370. Our critical region would include values less 
than 369.7 or greater than 380.3; since 370 lies within the confidence 
limits we accept the hypothesis that pr = pro® 370. 


It must be emphasized that results are defendable for small sample 
sizes only if the population is normally distributed. For large sample 
sizes the distribution of averages approaches normality regardless of 
the form of the parent population, To obtain tenable results, an ade- 
quate sample size hinges upon the amount by which the parent population 
deviates from the normal distribution and the variation in the data, 


B, Confidence Intervals for fone not known 


Where the population standard geviation, o , is known we have used 
the sampling distribution of 2 =X -, /°/-N Which is normally dis- 
tributed. However, we seldom know ao, Therefore, we use a test sta- 
tistic X -w / AWN which follows student's t - distribution, Although 
the t - distribution approaches the normal distribution for large sample 
sizes, it must compensate for the sampling fluctuation in both X and s 
which is significant for smaller samples, Since we want the confidence 
intervals to include a given percentage of the time, intuitively we 
acknowledge that for comparable conditions (i.e. sample sizes and risks) 
the confidence intervals tend to be longer where o@ is unknown than where 
co is known, 


Reference to Tables I and II will reveal that where co is known the 
confidence intervals are of constant length. However, with o@ unknown 
both the length and location vary and the average length of the interval 
is longer. The longer length of the interval does not mean that our 
estimate of ~ is any less precise; however, it reflects the lack of 
confidence in estimating ,. since we are relying on a sample estimate, s, 
rather than the parameter, co . 


Where o@ is unknown, the confidence interval for pn is 


X+ tho “IN <- 4. ¥% ty tec SAN 
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The values for t may be read from Table V for the values of «< 
selected and N-1 degrees of freedom, 


Example II, 
A new supplier of twine submits samples for tensile tests. We wish 


to estimate the populatign mean, = 90% confidence, Thirty 
samples are tested with X = 368 tnd s = 28, 


368 - Caarghige) <phK< 368 + (raha 
368 - 8.7 <s“~ &. 368 + 8.7 


359.3 <- << 376.7 


Our best estimate of the tensile strength of the new twine is our 
sample average, 368, and we are 90% confident that the interval 359.3 
to 376.7 actually encompasses the population mean, 


Here, again, we may observe the similarity between the confidence 
limits and the critical region used in tests of significance. We should 
also emphasize that this procedure is valid only where the population 
has a normal distribution, 


C. Confidence Intervals SER ping 2% =o: 7 is know 


= we wish to estimate the difference between two population means, 
D4! — o Shave Ge pepiatien variances are known and equal 
ae =F » the confidence interval is 


“Fat Bh VER, <P SRR the AOR 


Example III, 

Samples of twine are taken from two different runs and tested for 
tensile strength, Experience indicates that the means may be different 
but the standard deviation is known to be 16, We wish to estimate the 
difference between the means, ,.,+ and Ax, + of the two runs, The fol- 
lowing information is given 


Ny zs 25 
No =z 16 
%1 = 373 
X2 = 351 
31 = 14 
s2 = 17 


If we wish to be 99% confident that our limits se eaanee Pin Pe 
then Zu = -2. 576 and Z ,_ Ka* +2.576 (Table IV.) 


Then 
373-351 = (2.576)(16) |_1+ 1< ~M£373-351 + (2.576)(16) }_i+ 
| 25 16 Pi 25 i 
22 - 13.1 SA is < 22 + 13.1 


BI <p pee = 35 1 
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Our best estimate of the difference in strength between the runs is 
22. Moreover we are 99% confident that this true average is between 8,9 
and 35.1. 


By comparison with a test of significance based on the hypothesis 
that = /A~2 , we would have rejected the hypothesis since 0 does 
not Iie within the confidence interval, As in Section A, the validity 
of the test depends upon the normal distribution of the populations for 
small sample sizes. 


D. Confidence Intervals for wn, —M2 7% * 2 = @€ is unknowm 


Where @ is not known, the test statistic must rely upon the t - 
distribution rather than the normal distribution, For estimating the 
difference between two means where the standard deviations are assumed 
equal, but unknown, the confidence limits are: 


Kika tt yee “PV, SA Me 6 KK tte Peg 
where tio< and t1.4,are determined for Ni + No - 2 degrees of freedom 
Sp is the pooled standard deviation and is equal to: 


Sp = (M, = 1) 8,2 + (Mp = 1) 80° 
Ny + Mp -2 








Example IV, 
Assume the same information as is given in Example III but assume 
that the standard deviation, o@ , is unknown. 


= (24) (196) + (15)(289) 
*P \ gor 


= 15.2 





Then, 


373-351-(2.70)(15.2) + ie 73=351+(2.70) (15.2) + 
\# AHP Nee 


22 - 13.2 <pa,- my < 2 + 13.2 
BB KM pre < 3522 


The results are comparable to those of Example III, However, it 
should be observed that the reduced estimate of the standard deviation 
(15.2 as compared with 16,0) is more than offset by the increase in 


E. Confidence Intervals for Paired Comparisons 


If tests are conducted so that for each condition or level of test- 
ing, observations are obtained from each of two different populations, 
then paired and the differiences computed, we may use the following 
method of calculating confidence intervals: 


Xa ttn ad/JN SAA MS KA + tpg 44/IA 
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where Xa is the average of the differences, sq is the standard deviation 
of the differences and N is the number of differences, not total obser- 
vations, Also + and t _ are obtained for N-1 degrees of 

freedom, fae IYer 


Example V, 

We wish to estimate the average difference in tensile strength for 
two brands of twine so that we may be 95% confident of including the 
true difference, 


Two brands of twine are tested for tensile strength under various 
weather conditions, In each instance a test is made of each brand. 
Tests are made under five different conditions with the following results: 


Test No, Brand A Brand B Difference 
1 376 349 +27 
2 362 342 +20 
3 320 305 +15 
4 345 322 +23 
5 384 369 +15 


Then Xq = 20, sq = 5.2, N= 5, thee = -2.78 and ty 4,. = +2.78 


20 - (2.78)(5.2) / 5 < Ad <20 + (2.78)(5.2) /-]5 
20 - 6.5 <A < 20 + 6.5 
13.5 <a ¢ 26.5 


We are 95% confident that the limits, 13.5 and 26.5 encompass the true 
difference, 


As in sections B and D the t- distribution is restricted to cases 
where the population is approximately normal, Where we pair observations, 
this limitation will be fulfilled if each population has a normal distri- 
bution and the effects, under various conditions, are additive. 

F, Confidence Intervals for a” 


The confidence limits for 7* are based upon the X*/d.f, distri- 
bution, The ratio s*/o* is distributed according to X*/d.f.. If we 
let Phoc and Py +o represent the values for X2/d.f. for various degrees 
of freedom, then, 


Phos < el/aore < Pi toc 
By inverting and multiplying by s“, we have 
22 e s*> 
which are the confidence limits for 72 e 
Example VI, 


In example II we investigated a new brand of twine and estimated 
its tensile strength. Using the same data (X = 368,s = 28, and N = 30) 


522 











let us calculate the 90% confidence limits for 72 . 
734 z 784 
610 dint 1.45 


1286 po 2? 540 


We are 90% confident that the limits 540 and 1286 encompass the 
true value of a2 , 


By taking the square root of each term we obcain confidence limits 
for @ e 


H.2 PT > 23.2 


The confidence interval for @ is interpreted the same as that for 
Oo but may have the advantage of being somewhat more understandable, 


When using the X*/d.f, distribution we should be confident that our 
population is normally distributed, 
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Table I 


95% Confidence Intervals for 
with N = 5 drawn from a population with ~ = 40 and 
a2 = 100 


(1) oa“ known = 100 











































































































Table II . 
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2 ~ known = 100 
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Table III 


95 Confidence Intervals for 7 
Drawn from a population with ~~ =40 and co = 100 


(1) N= 5 


30 | 


20 7 


















































(2) Ns 25 











Table IV Areas Under The Normal Curve 











-00 To Z 

z AREA 
=3.090 001 
-2.576 2005 
13 as 
-1.645 2050 
-1,282 2100 
= 842 2200 
= 253 -400 

© 2500 
+ .253 -600 
+ ,842 ~800 
+1,282 2900 
+1645 2950 
+1.960 0975 
+2.326 990 
+2.576 0995 
+3.090 999 
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Table VI, Values of X*/d.f, 








Degrees 
of P f B Pr a F 
Freedom 2005 2025 2050 2950 2975 2995 
1 200004} .001 -004 3.84 5.02 7.88 
2 2005 025 2051 3.00 3.69 5.30 
3 -024 .072 e117 2.60 3.12 4,28 
4 2052 2121 -178 2.37 2.79 3.72 
5 -082 2166 2229 2.21 2.57 3.37 
~ 2113 2206 2273 2.10 2.41 3.09 
7 2141 o241 2310 2.01 2.29 2.90 
8 2168 2272 2342 1.94% 2.19 2.7% 
9 2193 2300 2369 1.88 2.11 2.62 
10 2216 0325 2394 1.83 2.05 2.52 
12 2256 367 435 1.75 1.94 2.36 
14 2291 2402 469 1.69 1.87 2.24 
16 2321 ~432 -498 1.64 1.80 2.14 
18 348 -457 -522 1.60 1.75 2.06 
20 2372 -480 2543 1.57 1.71 2,00 
30 -460 -560 -616 1.46 1.57 1.79 
40 -518 611 -663 1.39 1.48 1.67 
60 592 675 -720 1.32 1.39 1.53 
120 699 763 -798 1.22 1.27 1.36 
oo 1,000 1,000 1.000 1.00 1.00 1,00 
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A DISCUSSION ON ROTATABLE DESIGNS 


J. S. Hunter 
Princeton University 


Introduction 


Recently a special class of experimental designs called "rotatable" 
designs has been proposed as being in some sense "best" when it is the 
purpose of an experimenter to explore an unknown response function using 
the techniques first proposed by G.E.P. Box. This paper is an attempt 
to describe what a rotatable design purports to do. 


Some Background Material 





Let us first assume that the errors made in repeatedly observing 
the response py at the jth point in a factor space, are independent 
and Normally Distributed with a mean equal to zero and a variance o*. 
Then 


Yiu = wit €iu u= i. 2, sees N) (1) 


Further assume that as we move from point to point in the factor space 
the errors remain Normally Distributed with mean zero and variance o?, 
and that between any two distributions of errors the covariance is zero. 


Statistics are often formed from linear combinations of such ob- 
servations. By “linear combinations" we mean that the statistic is 
calculated using a formula of the type 


@1Y1 + 82Y2 + --. + OnYy (2) 
where 4, &2, ---, % are constants, and the observations y,, Ya, ---. 
yy &ppear only to the first power. For example, the average VY is 
such a statistic since 


1 1 1 
gy = © = on + yet... + om (3) 


The variance of a statistic such as that shown in general in 
equation (2) is given by the formla 


afot + aBoB + ... + affo# (4) 
and since the variances are all equal, by the formula 
(af + ab +... + ay )o* (5) 
The variance of the average y is therefore 
2 2 as lie o® 
Vv = — + + coe + o = N a 6 
(y) (2 xz 2 (F2)0 i (6) 


Of course in practice the true variance of the errors, o*, is not know 
and must be estimated by calculating the statistic s® where 


N 
sf = (ya - ¥)*/(N - 2) 


Having calculated a statistic, and having estimated its variance, 
the t distribution can now be used to establish a confidence interval, 
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by substituting in the expression: : 
Statistic + t (estimated variance of the statistic)* (7) 


where the value of t depends upon the probability level at which the 
confidence statement is to be made and upon the degrees of freedom enter- 
ing the estimate of the variance. Thus a confidence interval for the 
mean yp is given by 

¥ + |. (8) 
We now adopt the attitude that all postulated values of yp that lie 
within this interval are not contradicted by the data and are hence 
“acceptable” and that all postulated values of wu lying outside the 
limits are "unacceptable" or "significant". By "unacceptable" or 
"significant" we mean the probability that the given observations could 
have come from a population having this postulated value of yp is so 
small as to compel us to declare the postulate unreasonable. 


Simple Linear Regression, lst Order 





Let us now suppose that an experimenter is studying the relation- 
ship between some response variable 1 and some simple controlled 
variable x. Initially nothing may be known about the relationship, 
the variable 1 being simple an unknown function of x, i.e. 


7 = f(x) (9) 


As a first step in exploring this association between and x, it 
is frequently assumed that the response 4 is a simple straight line 
function of x, i.e. that the relationship between 1 and x can be 
adequately represented by the first order model 


1 = Bo +x (10) 


where 6, is the intercept of the line and 6 is its slope. Since 
the observed value of the response y will never exactly equal 14, the 
mathematical model actually used is 


yY = Bo +Bx+e (11) 


For example suppose the following observations (the y,;) are 
recorded at the indicated settings of the controlled variable x, and 
that it is the decision of the experimenter to fit the first order model 
y = Bo +B6x+e€. We then have 


Controlled Variable Response Variable 
= zx 
2 8 
3 7 
4 5 
5 3 
6 1 


The least squares estimates of the coefficients B, and 6B are obtained 
from the following familiar formulas 
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sxy - (2)(2y) Sxy 73 - (20)(24) 18 





N 
b(the estimate of 8B) = =— = es ee 
Ix2 - ea t= 9) Sx® 90 - aa 10 


bo(the estimate of B,) = ¥ - bx = oe -b a = = - -1 8) $20) = 12.0 


The fitted equation is therefore, 
Ff = 12.0 -1.8 


where 5 stands for the predicted value of y at any chosen setting of 
x. Since b, = ¥ - bX another convenient way of writing the fitted 
equation is 


n 


y 
i.e. $ 


¥ + b(x - X) (12) 
4.8 - 1.8(x - 4). 


The least squares fitted equation guarantees that the sum of squares 
of the quantity (predicted responses - observed responses) is a minimum, 


N 
o 2 
i.e. that 2 (91 - y,)* is a minimm. 


The predicted values of the response calculated from the fitted equation 
are: 


Controlled Predicted Observed 
Variable Responses Response 


x F y ($-y) ($-y)* 

2 8.4 8.0 0.4 0.16 

3 6.6 7.0 -0.4 0.16 

4 4.8 5.0 -0.2 0.04 

5 3.0 3.0 re) 0 

6 i.2 1.0 0.2 0.04 
0.0 0.40 


We observe now that <X(j-y) equals zero and that £(f-y)® = 0.4o. 


The Analysis of Variance based on the first order model is: 





Sum of Degrees of Mean adtice 
Squares Freedom Square 
Total Crude Sum of Squares = Zy* = 148.0 a 
2 
Sum of Squares absorbed by v= = 115.2 . 
Sum of Squares absorbed by b = bSxy = 32.4 1 32.400 F=ok3** 
Residual Sum of Squares (by subtraction)O.4 3 0.1333 = s* 


As a check on the calculations we note that the residual sum of squares 
in the Analysis of Variance Table is equal to <X(f-y)® = 0.40. We ob- 
serve further that the estimate of o?, the variance of the individual 
observations, is given by dividing residual sum of squares by its degrees 
of freedom, i.e. s® = 0.1333. 
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It should be remembered however that the difference between the pre- 
dicted values ? and the observed values y will measure experimental 
error and contributions caused by a choice of the wrong mathematical 
model. In this example it is assumed that the differences (f-y) are 
due to error alone. \ecepting this estimate of s* as a valid estimate 
of o* an F test of the mean squares associated with the coefficient b, 
with one and three degrees of freedom, is found to be highly significant 
and we conclude that the fitted model is an adequate representation of 
the unknown relationship y = f(x) over the range of x studied. 


A plot of the data and of the fitted first order model y = 
12.0 - 1.8x is given below. 












bf 
A 
6 
T Figure 1 
Response € 
Variable 5 
y 4 | = 12.0-1.8x 
3 ‘ 
2 95% Confidence a’ 
1 Interval about } 
1 4 i i | > x 
1 2 3 4 5 6 7 
a ! cede ! n — 
-3 -2 -1 0 1 2 3 


Now, as mentioned earlier, the confidence interval about any 
statistic formed from a linear combination of the observations, is given 
by the formla 





statistic + t./variance of the statistic 
The quantity z computed from the fitted equation for any value of x 
is such a statistic and we proceed now to construct a confidence 
interval about 7. 
Since f = 7 + d(x - X) the variance of F is 


v(}?) = Variance (7) + (x - ¥)* Variance (v)* (13) 


“— V(F) = v(F) + p°V(v) (14) 





" We note here that the x's are controlled variables assumed fixed 
without error and have therefore no association with the errors made 
in observing the y's. Hence the quantity (x - ¥) is simply a 
constant akin to the constants "a," given earlier in equation (2). 


where op is the distance between the chosen value of x and the average 
3 If we code the x's so that X-=0 we may write 


V(?) = V(b.) + e*v(b). (15) 


Thus a confidence interval about § for any setting of x is obtained 
using equation (7), that is, by calculating 


¥ +t /v(F) + p*v(d) . (16) 
o? ‘ o2 o@ 
where V(J) = 7 «(and we are given that V(b) = —s* = aa, >)" 





Of course in practice the variance o* is estimated from the data. 
In the yyy at x =2, i.e. at op = (x - X) = the predicted 
response is = 8.4. The OFA, confidence interval ain * the statistic 


is 





1333 2 0.1333 
8.4 + eS + (-2) ae 


or 8.4 + 3.18(.28) or 8.4 + 0.9. 


We can now obtain a confidence interval for 5 for all values of 
Xe \ plot of these intervals is also shown in Figure l. We note from 
this figure that the variance of y; and hence the confidence interval, 
is smallest when p = (x - ¥) = 0 and symmetrically increases as po 
becomes larger. 


Linear Regression, end Order 





Of course the experimenter may find that the first order approxima- 
tion to the unknown response function is inadequate. One measure of the 
adequacy of a fitted model is to compare the quantity 2(f - y)®/(N - 2) 
against some measure of experimental error variance that is independent 
of the fitted model. Such measures are obtained by replicating obser- 
vations at one or more settings of x. If the error variance calculated 
from replicated values is significantly smaller than that calculated by 
taking <(f - y)® then the model is inadequate. In the event a first 
order model is found inadequate, or when the experimenter feels from the 
start that the response y gives a curve when plotted against x, a 
second order model can be fitted to the data, i.e. the model 


Y = Bo +Bixi + Baxi + € (17) 

Some slight changes in subscript notation are to be noted. The con- 
trolled variable now appears as x, and its associated coefficient as 
By. The double subscript of 6), indicates that it is a coefficient 
associated with the second order term x5. We cannot take the time here 
to show how to estimate the three coefficients Bo, B, and 6y,. # The 
calculations are a bit lengthy (but not difficult) and are fully descri- 
bed in several references (1), (2), (3). The fitted equation then is 


$ = bo + daxy + Dyaxr (18) 
The variance of ¥ is 
V(¥) = V(b) + p*v(b,) + pv(bax) + 2p? cov.(bp, br) (19) 


where p is once again the distance from X¥ to x. This formula for 
vV(F) requires a few comments. First, the values for x entering the 


535 








calculation must be coded so that <x; =0, i.e. so that F=0. 
Secondly we observe that there is a covariance term occurring between 
Do and by,- This term will always be present since the estimated 
coefficients b, and b,, as they appear in this model can never be 
completely uncorrelated. References (1), (2), (3) tell how to compute 
these variances and covariances. Knowing V(¥) we can now proceed to 
establish a confidence interval about a calculated at different values 
of x. Once again the size of V(¥) and consequently the size of the 
confidence interval is smallest when p =0, i.e. at the point xX and 
becomes progressively and symmetrically larger as the distance po = 

(x - xX), plus or minus, becomes larger. The important point to note 
about equation (19) is that variance of a prediction made at any point 
x is solely a function of p, the distance from the center of the x 
coordinate system. 


First Order Rotatable Designs 





Suppose now that a response y is an unknown function of two con- 
trolled variables x, and xa, that is 


y = f(x), x2) (20) 


For example, the response may be the yield of a chemical batch process 
and the two controlled variables the time and temperature of the reaction 
As in the one dimensional case, the first step in exploring this function 
might be to fit the first order model 


y = Bo + Bax, + Baxa +e (21) 


to the observations recorded at the different settings of the controlled 
variables x, and xo. 


The most popular experimental design for obtaining estimates of the 
coefficients in this first order model is the two level factorial design, 
i.e. the 2X factorial design with k = 2. This experimental design 
requires that each controlled variable be fixec throughout the experi- 
mental program at a high level and at a low level, and that all. possible 
combinations of high levels and low levels of the variables be run. For 
example, in studying the relationship between the percent yield of a 
process as a function of time and temperature the two level factorial 
design would take the form: 





Controlled Experimental Response 
Variable Levels Design Levels % yield 
Time Temp. Xj X2 y 
l hour 240° «2 -1 ¥= 1 
5 2ho° 1 -1 yYe= 6 
1 280° -1 1 ¥3 = 7 
5 280° } 1 yi, = 18 


The mathemetical model is fitted to the experimental design levels 
rather than the actual levels of time and temperature. The coding 
mechanism associating the design variables x, and xa with time and 
temperature are 


x, « ‘timeinhours-3, | _ temp °C - 260 
2 2 alts 20 





The four experimental design points when plotted in the x, xo 
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coordinate system form the vertices of a square as illustrated in 
Figure 2. 








X2 
2 1.0% e 
aaail a 
1.0 ro * 
* -1.0 a & 


Figure 2. Coordinates of the 2° Factorial Design 


Estimates of the coefficients in the mathematical model are given by 











= 31 +*+% + Yt Y 5 
% 2 — rn 3 . ; thus b, = = = 8.0 
b "%y * Yo ~ ¥3 + My; thus by = 16 = 4.0 
s * ~ a 
TF, “Fo * 7 +7 
be a i a = 3 thus Vo = 8 = 4.5 
The fitted model is 
J = by + bax, + baxe (22) 
i.e. § = 8.0 + 4.0x, + 4.5x0 (23) 
Now the confidence in any value of 7 predicted at some setting of x} 
and Xe depends, as shown, on the variance of S. The variance of y 
at any setting x*,, xe is 
V(¥) = V(by) + xZV(ba) + xaVv(b2) (2k) 


The reader should be able to show quickly, using equation (5) that 
V(bo) = 07/4; V(b,) = 07/4 and V(be) = 02/4. Thus 


A 2 
WF) = Ft. + xf + xd] (25) 


Using the Pythagorean theorem the distance out to any point x,, xo is 
given by po = hee + x8 so that equation (25) can be rewritten as 


2 

VF) = Fl + 07] (26) 
Thus for the 2° factorial designs the variance of the predicted value 
y at the point x,, x2 is a function of p*, the square of the 


distance to the point. Thus all predictions made at a constant distance 
from the center of the design have the same variance. 
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The condition that v(¥) be a function of the distance p is a 
necessary but not a sufficient condition to show that the ek factorial 
and fractional factorial designs are rotatable first order designs. For 
a design to be rotatable one other condition must be satisfied, namely 
that the variances of the coefficients must remain unchanged as the design 
is rotated. For example, suppose we rotate the 2° factorial design 
given in Figure 2 through an angle 9 as illustrated in Figure 3. 





X2 
A 
1.0+ 
* 
‘ 
‘ 
‘ 
\ 
‘ 
\ 3 
‘ a 
‘ 
+, 
SS ‘| - L > Xz 
-1.0 _— se 
o ‘ 
-* ‘\ 
aie: ‘ 
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\ 
‘ 
‘ 
‘ 
& 
-1.0 + 





Figure 3. A 2® Factorial Design Rotated through an Angle o 


What are the variances and covariances of the estimated coefficients when 
the observations are taken at these new points? It can be show (ref. 4) 
that these variances remain unchanged, that is, V(b.) = o@/4, vV(b,) = 
o7/4 and V(be) = o®/4 regardless of the orientation of the design. 

In addition to the fact that the variances remain constant we have seen 
that simultaneously v(f) depends on po. Designs which meet these 
conditions are called rotatable designs. The 2* factorial and frac- 
tional factorial designs, when used to estimate the coefficients in the 
first order model 





k 
~ = 
y = Bo + ,2,8,x, + € (27) 
are rotatable first order designs. A very large class of rotatable 
first order designs exist, with the 2k fractional factorial designs 
being, by far, the most widely used. 


Second Order Designs 





Very often the attempt to represent the unknown response function 
y = f(x,, x2) by a first order model is unsuccessful and the experi- 
menter then often attempts to fit the second order model 


2 
Y = Bo + Byxa + Boxe + Baixa + BooxS + Biaxixe +e (28) 
The many uses of this second order model are mentioned in the references 


(1), (5), (6). 


Let us first discuss an experimental design which is not rotatable 
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but which will provide estimates of all the coefficients in this model. 
The design in mind is the three level factorial design, i.e. the 3k 
factorial where k=2. The design requires that each of the controlled 
variables be held at a high (+1), middle (0) and low (-1) level, and 
that all combinations of levels and variables be run. The coordinates 
of a two dimensional, three factor factorial design and a diagram of the 
design are given below 


x2 xy X2 
r | 1 
0 1 
-1 1 
a .@) 
0 0 
-1 0 
1 -1 
0 -1 
-1 -1 





Figure 4. Experimental Design Levels of a 3" Factorial Design 


We note that the origin and scale of the design have been chosen so that 
rx; = 0. If the response y is now recorded at each of these nine 
points there will exist sufficient numerical evidence to estimate all 
the coefficients in the second order mathematical model equation (28) 

to give the fitted equation 


A 2 2 
Y = 09 + byxX, + baxe + byyX, + beaxa + byax\xe (29) 


Now it can be shown that 


2 2 
V(b9) = 2S V(ba2) = V(bae) = <3 
o* of 
V(b;)} = V(ba) = a3 Voie) = F (30) 
2 
Covariance(b,,b,;) = Covariance(b,,be2) = - = 


The variance of a for the second order model is 
2 2 4 4 
V(¥) = V(b.) + V(by)xz + V(b2)xd + V(baa)xa + V(be2)x2 


22 2 2 
+ V(by2)x1x3 + 2 cov(b,,b,1)xy + 2 cov(b,,b22)xa (31) 


Substituting in the values of the variances and covariances of the 
coefficients and-collecting terms gives 


WF) = oF ( 2 - Axi + xB) + ext + Ok + xixz) | (32) 


. quick look at this equation for v(f) should convince the reader that 
the variance of ? at any point x,, x2 is not simply a function of 
the distance from the center of the design po but very much depends 
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upon the specific setting of x, and xg. When this occurs we can 
quickly declare the design is not rotatable. 


It is also interesting to observe what happens to the variances and 
covariances of the coefficients as this three level factorial design is 
rotated through an angle ». For example, if o = 45° then the design 
given in Figure 4 becomes that shown below in Figure 5. 


Xi Xe2 
ro 1.414) 
-0.707 0.707 
“1.414 © 
0.707 0-707 
° 18) 
-0.707 -0.707 
1.414 0 
0 “1.414 
0.707 “0-707 











Figure 5. Experimental Design Levels fora 3" Factorial Design 
Rotated through an Angle of 45 


In this particular orientation the variances and covariances of the 
coefficients become: 


2 
V(b.) = =; V(ba2) 


V(b22) = oo", 


v(by) = V(b2) =; — vV(baa) = 0°; 


i 
Covariance (b,, by,) = Covariance (bo, bea) = . 
=e (33) 
Covariance (b,1,b22) = ie 3 


Clearly the variances and covariances of several of the coefficients have 
changed and an additional covariance term, which was zero previously, has 
now appeared 


Rotatability 


We see therefore that if the v(F) at a point x,, Xg@ changes 
as we move from point to point even when the points are all the same 
distance from the center of the design, the design is not rotatable. 
Viewed another way, if the variances and covariances of the coefficients 
in the model change depending upon the orientation of the design, the 
design is not rotatable. 


In attempting to explore an unknown response surface using either a 
first or second order model, the experimenter will not of course know at 
which point in the experimental region his interest will ultimately lie. 
It would seem unfortunate then to use an experimental design which 
provided very small v(}) at one point (and hence narrow confidence 


540 


intervals) and large v(f) at some other point (and hence wide confi- 
dence intervals) even though both points were equidistant from the center 
of the system. Certainly such a design provides the experimenter with a 
rather astigmatic view of the response surface. However, if the experi- 
menter uses @ rotatable design the vV(f), and hence the confidence in 
any prediction, will be a function solely of how far the point of interest 
lies from the center of the design, the particular direction in which the 
point lies having absolutely no influence. 


The 3k factorials and fractional factorials do not provide second 
order rotatable designs, nor do the ek factorials. The 3* factorials 
are however rotatable first order designs. This is quickly shown since 

a 
if y = b>, + byx, + baxe then 

2 2 2 2 
v(¥) = §- + Fa + $2 = F1 + p®] 
and once again the variance of § at any point would be a function solely 
of the distance of the point from the origin Of the design. A full des- 
cription of the theory underlying the construction of a rotatable design 
is given in ref.(7). 








4 Rotatable Second Order Design 





Let us now take a rotatable design such as the hexagon design with 
the center point replicated three times 








x2 
A 
eo © 
1.0 F Xx X2 
T 1.732 o 7 
* = 1.500) 
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| 0 o J 








Figure 6. The Hexagon Design 


Note that the x's have been coded so that <x; = 0, xt = 9, the total 
number of points. The fitted equation would, once again, be of the form 
given in equation (29). The variances and covariances of the coeffi- 
cients are 


2 2 
V(b) = $5 V(bia) = Wibae) = 2py 
2 2 
V(bi) = v(be) = £3 v(die) = 305 (34) 
-o* o2 
Cov(b,,b12) = cov(b,,b22) = > cov(b33,b22) = si 


Thus, using the second order model we get for the variance of 5 
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VG) = 0 F + xa + xe) + xt + xd + Qxtxz) ) (35) 


or, letting p* = x? + x3 
— ft eer > oh 
vif) = oF { z+ 50° + He ) (36) 


Here we see that the variance of any predicted value is a function solely 
of how distant the point is from the center of the design. Furthermore, 
it can be shown that the variances and covariances of the coefficients in 
the second order model do not change when the design is rotated through 
an arbitrary angle » (ref.7). Hence, the hexagon design is a rotat- 
able design. 


Conclusion 


Early in this paper it was shown that if we fitted the simple first 
order model y = 6, + Bx + € (and took the precaution to code the x's 
so that =x = 0) then the variance of a prediction made at a distance p 
from the point x =0O is 


V(F¥) = V(b.) + p*v(b) 


If a first order rotatable design is used to fit the first order 
model 


k 
y = By + 2 Bix + € 
then regardless of the number of controlled variables x,, x2, ..-, X,, 


it is similarly true (provided that each x; is coded so that <x; =0 
and <xj =N, the total number of points) that 


V(¥) = V(b) + p*v(bz). 


In this instance p is the distance from the center of the design to 
the point (x,, x2, .--, X,) at which the prediction is made, i.e. 





2 2 2 
Pp = Xz + XQ t+ o-- + Xp 


Similarly it was shown earlier that if the second order model 
y = Bo + Bax, + Baix, is fitted then 


V(F¥) = V(b) + p2v(dr) + pV(bia) + 2p? cov(by,ba2). 
If a second order rotatable design is used to fit the model 
e k k 
y = By + 2Byx; + 2 jPP as + ippPis*s Ss 
then regardless of the number of variables x,, xa, ..-., xX, it is 
similarly true that 
v(¥) = V(b,) + p*V(», ) + ov(b33) + 2p* cov(b,,b;;)- 


It is this element of simplicity, i.e. the ome to one correspondence 
in obtaining v(?#) using regression equations with one or with k con- 
trolled variables, that identifies as unique the rotatable designs. 








(1) 


(2) 


(3) 


(5) 


(6) 
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PRODUCTION RELIABILITY 


Richard A, Carlson 
Napco Industries, Inc. 


The question of Reliability has received mch attention in engi- 
neering design groups, QC staff groups, component evaluation groups, 
but the department at the end of the line "Production" seldom is con- 
sidered. Much effort has been expended in some areas to achieve 
Reliability in production on specific assemblies or a product. Little 
effort has gone into the general application of increasing Reliability 
in production. Due to the nature of the beast, concentration in 
specific narrow areas won't raise the overall levels of Reliability. 


Production Reliability can be achieved if the desire to improve is 
implanted in all levels of production management. It is not just an- 
other technique like QC. It cannot be delegated to a staff assistant. 
By themselves production engineers, QC engineers or the janitor cannot 
insure the success of the program. But all levels from the production 
workers through the foreman to the plant manager mst have the Relia- 
bility religion. The staff groups servicing production must also be 
infused with this same fervor. Then you can expect tangible results 
from reliability applied to production, 


The answers to three questions will supply us with ideas necessary 
to apply reliability to production. Basic understanding by everyone as 
to » Where and How of production reliability is vital to the success 
of the program. ~— 


Wy 

For those engaged in strictly military work part of the why is 
obvious. More and more contracts are specifying a reliability prob- 
ability. The application and translation of this probability into an 
equipment test program can be quite difficult. The increasing complex- 
ity of systems and equipment is another factor involving Dr. Robert 
Loesser's famous cry "p overall". High maintenance costs coupled with 
high failure rates bring the customers, either commercial or military, 
screaming back to the sales department demanding some changes. Equip- 
ment failure at critical times never makes a customer happy. The loss 
of a mission, lives or equipment brings a very serious picture into 
focus. 


These comments are intended to show the need for increased relia- 
bility. This is applicable to all areas but intended to be used here 
as reasons for the need of increased production reliability. A need 
must be shown to people before they become interested enough in a 
problem to take action. 


The notation "p overall" is a concept that is very important in the 
understanding of where and how production reliability can best be 
achieved. The probability p overall is merely the product of the prob- 
abilities of all of the individual components. This meaning is ex- 
pressed in terms of the end product performance. An example of a 
p Overall of 80% in a unit of 00 components allows only one failure in 
1800 units of each of the components, Thus the understanding of this 
concept is necessary before proper steps can be taken to increase 
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production reliability. This concept has an effect on the method and 
direction taken in attacking problems, to the extent even of the 
sequence of the problems. 


Where 

The problem of where to look for production reliability begins with 
a definition. The normal definition of reliability is “worthy of con- 
fidence." This applied to production brings to the focus that, what is 
built mst be worthy of confidence. This begins with the state of mind 
of all personnel involved. Everyone mst be convinced that the products 
being built.are to be the best of everyone's effort. The product is the 
best design, built with the best tools by the most highly skilled 
craftsman in the industry. This feeling must permeat all personnel. 
Thus the definition of production reliability is "a state of mind." The 
attitude of all personnel towards workmanship or craftsmanship, a pride 
in doing a fine job is a prerequisite of production reliability. 


The discussion on workmanship leads us to other areas. It is nec- 
essary to understand the team concept of working together. The team 
consists of all personnel in engineering and production as well as staff 
groups assisting. The line worker mst feel the sam pride as the 
engineer. Each mst feel that the other is a necessary part of the 
team, with the product as the common bond. 


Before reliable production of a product can occur, there mst be a 
reliable product. The design must be such that it results in a highly 
reliable product, in short the engineer mst have done an excellent job. 
The problems of reliability barriers, component evaluation, component 
variabilities mst have been solved with the best compromise. The re- 
liability deficit between growth of equipment complexity and growth of 
component reliability is growing wider. A reliable design must come 
first. This design and its components must be tested to failure. The 
reliability boundaries and safety margins mst be checked and evaluated 
again and again. Due to our friend p overall the usual concepts of 
specifications are not good enough. The interaction of component var— 
liability, variables in adjustment and alignment, interaction of assem- 
blies grow more complex each day. The state of the Art is not yet 
mature in these areas. 


Even as a well tested reliable design arrives in production new 
yardsticks must be used in processing and manufacturing. The usual 
techniques are not adequate. The ordinarily accepted QC concepts are 
not adequate. Processing cannot be compartmentalized. Effects of a 
later process can negate satisfactory early processes. Careful selec- 
tion of components can be ruined by later immersion in hot solder or 
paint drying ovens. Care must be exercised in handling and in the at- 
titude towards errors. Each person should feel that the best is none 
too good. If an operator finds an error missed by inspection or other 
workers it must be culled out and rejected. 


Quality Audits by engineering of finished products in final ship- 
ping must be contimous. Changes made in design and processes must be 
evaluated in test programs. One aircraft equipment mamfacturer dis- 
covered to his horror that 100% of his fuel gage equipment began fail- 
ing. Original models had passed all airborne and environmental testing. 
But over an eighteen month period a cumlative effect of many minor 
changes had taken their toll on the equipment's ability to pass the 
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specifications. Each small change had been checked, but few tested. 
Now this manufacturer has a Tight Quality Audit System on shipped goods 
stock. 


Analysis of Where on production reliability shows that the engineer- 
ing and production team must first have a reliable design thoroughly 
tested and evaluated with continuous evaluation of production units. 


How 


The How of production reliability must again cover more than just 
the manufacturing areas. 


The first and most important part of production reliability is 
education. As has been repeated before in this paper, everyone mst be 
quality and reliability conscious. This feeling has to be coupled with 
training programs stressing workmanship. Quality and Reliability meet- 
ings are held periodically like Safety meetings. That is why all levels 
of management must be convinced the reliability can and mst be achieved. 
Other training programs for workers covering elementary QC principles 
and work simplification keep them informed and interested in what's going 
on. The use of production handbooks similar to the Western Electric type 
assist in new worker training and in getting consistent performance of 
many assembly process hand techniques. 


Design analysis for producibility by production and tool engineers 
is necessary early in the design phases. Ease of manufacture and cost 
reduction can be accomplished if there is good relations in the engi- 
neering team. Each must respect the other and realize that both produc- 
tion and design engineers are necessary. There statistical quality 
control techniques are useful when applied to models or short runs. 


The processing must be carefully considered to avoid damage or 
undue stress to components. The processes must be tested and analyzed 
from many angles. Tooling and inspection test equipment needs detailed 
analysis. There again statistical quality control techniques are of 
great assistance in model and trial run analysis. The use of production 
environmental testing is controversial. Care must be exercised else 
more troubles will be added into a product than errors called out. Still 
the best test for a cold solder joint in my opinion is a nice cold bath 
in air in a freezer at -50°C, then bring to room temperature abruptly. 
Excessive testing may shorten the life of the product. 


Close liaison must be maintained between component engineers, pur- 
chasing, production engineering, inspection, QC and production control 
to see that component problems with vendors do not get out of hand. At- 
titudes towards specifications, component testing at vendor and home 
plants must change. Vendor education is a mst. Materials handling of 
components, both shippers and in plant, mst be under careful control. 
In missle work 63% of failures have been due to component failure. 


Subcontractor problems are similar to that of component vendors 
complicated by the fact of performance specifications of assemblies. It 
must be emphasized that a subcontractor is an extension of your ow 
plant. You cannot accept conditions in his plant that you wouldn't ac- 
cept in your own. Again education is necessary. In this respect pro- 
duction engineering mst have firm control of subcontractors and purchas- 
ing and production control. 
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The manufacture of piece parts is fairly straight forward, but it 
too can have pitfalls. Fortunately experience to date has shown that 
the reliability of structures is good. There have been no missle fail- 
ures due to air frame failure for example. 


The Assembly areas offer fertile fields for problems to occur. 
Here supervision and training are the keys. Union relations mst be 
considered and can very often be extremely helpful if approached proper- 
ly. The functions of inspection and verification mst be carefully 
understood and delineated. Inspection is performed at definite points 
with written procedures and accepted equipment. Verification is per- 
formed by operators on their own or previous work, where errors are dis- 
covered as operations are being performed. The unit is rejected in 
either case as errors are discovered. Both types of rejects are analyzed 
by QC. The emphasis is returned again on education and training of 
supervision and working force. A foreman told me once that he was happy 
with one bad solder joint in 5,000 joints. I pointed out that this was 
a p overall of about 60% and wasn't acceptable. A rate of one bad in 
100,000 or better should be his goal. He left with his head down like 
a bulldog. Some surprising things can turn up in production evaluations 
when looking at it from new aspects. 


In the QC and Inspection area the usual techniques of quality con- 
trol are not completely adequate. Failure analysis by engineers of 
production rejects, test failures and field failures is also vitally 
necessary. An example of failure analysis is that of printed circuit 
chassis. By using punched card systems of lot identity of materials, 
components and assemblies it is possible to examine the history of any 
one printed circuit assembly. Does it have an excessive number of 
rejects or replacements, do the lots from which the components come have 
poor reject rates, were there process troubles during the assembly of 
the unit? Thus it is possible to look at a finished assembly that has 
passed inspection yet say that due to various conditions of its com- 
ponents cr processes that it might have a high probability of failure, 
therefore reject it. The Quality Audit as described before is another 
link in the chain of reliability. 


The lot identity program as described above as handled by the mater- 
ial analysts is a bit complicated. The advantages in the initial pro- 
duction runs far outweigh disadvantages. The knowledge gained is 
extensive. These mst be handled with common sense. They can include 
raw materials, components and processes. A very complete picture can 
result. 


The last item for discussion is costs. The initial costs are high 
and they are difficult to justify now on equipment which has no mainte- 
nance or failure history. But the attempt mst be made by management to 
budget reliability costs into a program. They then can be handled like 
any other item of cost. It has been discovered that in production 
other economies result from reliability work. Normal results are low 
reject and rework rates in production. Component sorting by semi auto- 
matic means has also resulted in lower reject and rework rates. Dollars 
can very often be hooked to this sort of savings. A final remark can be 
made concerning costs, you can compare the cost of reliability with the 
high cost of failure. 


In summation it can be said that for military requirements and 
commercial customers too that you mst become reliability conscious. 
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You might as well get on the band wagon but with some common sense. 
Reliability isn't too rough to get into if you aren't too hidebound to 
begin with. By using ingemity, common sense and salesmanship highest 
reliability can be achieved in production - it can be maintained - and 
it can be done at a reasonable cost. 
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ROLE OF RESEARCH IN QUALITY IMPROVEMENT 


Richard D. Wells 
Assistant Director 
Fabric Research Laboratories, Inc. 


My comments to this group today will be made with the assumption 
that all of us here accept both Research and Quality Control as vital 
factors of business success. However, such an assumption would not be 
as certain with all textile audiences. There is still a strong senti- 
ment, in this primarily production- and price-oriented industry, that 
product “qualities” have only secondary significance. While there are 
notable exceptions to such a sweeping generality, it is a fair asser- 
tion that in the textile industry at large, research and quality control 
are still classed as "supplemental services", and have not assumed full 
stature as contributors to corporate policy and strategy. 


This statement is not made from a sense of frustration, nor is it 
made in a derogatory sense. My point is that perhaps neither executive 
management, nor ourselves, have considered research and quality func- 
tions in the full potential frame of reference. It is my purpose to 
suggest some of the opportunities to extend the roles of research, and 
of quality control, to good advantage. These opportunities lie, for 
the most part, in the closer collaboration of the two functions. 


In contrasting research with the usual limited role of quality 
control, one can say that in the first is the science of what might 
be, while the latter is the art of tolerable operations with what now 
is. Research is optimistic in the sense that it always assumes better 
things are possible. Quality control is largely premised on the belief 
that perfection is improbable. Both attitudes have their place, but 
must be kept in balance. A philosophical comment might be that close 
collaboration would have value, if for nothing else, in tempering re- 
search with a bit of practicse!‘ty, and in prodding quality control to 
a bit more dissatisfaction with the status quo. 


It has been helpful to me in organizing thoughts on this subject 
to refer to basic definitions. We are, I am afraid, rather prone to 
sloppy semantics, being so accustomed to narrow or specialized meanings 
that we tend to overlook the implication of the terms we use. As has 
been pointed out by more able scholars than I, human thinking processes 
are bound entirely by words and meanings. Corporate thinking is nowise 
different. 


We see so-called "Research Departments" which, whatever their value 
may be in other services, make no pretense of research other than in 
name. There are capable Quality Control Departments, perhaps fulfilling 
a real need, but which may have very little to do with "quality", and 
have no semblance of "control". The error is to assume, since the terms 
appear on the organization chart, that the research and quality control 
needs of the company are covered. This may be true in part, or not at 
all. Very seldom are these functions thought through and defined in 
specific terms in the same sense as are the other business functions 
such as production, sales, engineering, or accounting. At this time 
when so much soul searching is going on to seek causes for the ills of 
the textile industry, it may not be out of order to suggest that inade- 
quate understanding of words may be a contributing factor. 
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I have confidence, at least that proper terminology will clarify issues 
and aid the diagnosis. 


Following Webster, we shall consider research as "careful search or 
inquiry, usually critical and exhaustive investigation or experimenta- 
tion, having for its aim the revision of accepted conclusions in the 
light of new discovered facts". The "conclusion" of the research pro- 
cess is not necessarily an abstract theory, nor, in the other extreme, 
a new product idea. It may be a very down-to-earth conclusion, as for 
instance that a diligently sought after improvement in a product or 
process may be inherently impossible, or actually meaningless, or per- 
haps even undesirable. It might be that in a certain quality control 
problem the variance at A is positively related and calculable from 
observed variance at B, and hence requires no separate testing. It 
might be, from an operations research study, that on a long term basis 
the quality loss penalty of a proposed new system, work assignment, or 
cleaning and maintenance schedule, would more than offset all positive 
advantages. No matter under what title a person may draw hi's pay, he 
is conducting research if he is deliberately seeking facts on which to 
base logical revised conclusions. The "revision" may be something new, 
something changed, or the confirmation or denial of a hunch or rule-of- 
thumb. 


Examining the definition of "quality", we find three distinct mean- 
ings pertinent to our subject. Quality can refer to any characteristic, 
good or bad, intrinsic or incidental, as a describable and perhaps meas- 
urable property. We can speak, for instance, of typical "textile quali- 
ties", as distinct from paper or plastic film qualities. These are the 
characteristics that make something what it is, that make it desirable 
for some functions, and perhaps undesirable for others. In a sense, 
this meaning can be taken as the "qualitative” sense of quality. 


The second meaning, one might say the "quantitative" sense of 
quality, relates to grade or relative degree. It represents a ranking 
on some scale of values. We speak of high quality, low quality, or 
below standard quality, always assuming, of course, that the one to 
whom we are speaking knows and accepts our frame of reference. 


The third meaning of "quality" connotes general excellence. In 
this sense "quality" is a rather snobbish term, and those who use it 
generally avoid specific reference to individual attributes supposedly 
contributing to that excellence. Perhaps this meaning can be better 
understood as the antithesis of shoddy, skimpy, uneven, or unreliable. 
But even that is not valid in all cases. A "quality" tweed to many 
minds is the crudest, harshest, and most amateurish looking one that 
can be found. In women's hose, it isn't a sign of "quality" to last 
more than a few days of wearing. But whatever the basis may be, tech- 
nical or entirely subjective, quality excellence is primarily an in- 
pression in someone's mind gained at least in part from the assumed 
attributes of the item. This sort of observation may seem rather re- 
mote from statistical quality control, or technical quality improve- 
ment, but it is worth a pause to consider that a narrow CV chart never 
sold a yard of goods except as somebody had been conditioned to con- 
sider that a sign of quality excellence. 


The word "control" is another term that has in technical usage 
taken on an unduly restricted meaning. There are actually two quite 
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distinct implications. We commonly think of control only in the regu- 
lative sense of checking, adjusting, and classifying. A main purpose 
of these remarks is to focus attention on the other implication of 
"control", namely, the directive or guiding role. As useful and neces- 
sary as the regulative function may be, perhaps the biggest opportunity 
for benefits from quality control is in the directive role. In essence, 
that means decision making responsibility for the pattern to be followed 
rather than merely the policing of the operations to assure conformance 
to the established pattern. The term “Quality Improvement" as used in 
the title of this paper refers to one phase of this directive control. 
While on the subject of definitions, however, it should be noted that 
"improvement" has a meaning, "to turn to good account", (i.e., improving 
one's time) which is fully as appropriate to our general topic as its 
common connotation of enhancement, or increase. As will be discussed 
later, improvement may at times be in the lowering of a standard, if, 
considering all other conflicting factors, such reduced quality is more 
appropriate for the need. 


Taking the several meanings of "quality" and "control", in their 
possible combinations, we find a much more comprehensive view of quality 
control than that usually envisioned. Though I would not maintain that 
operationally these are six wholly separate functions, the following 
table is a useful reminder of objectives and possibilities: 





Control 
Directive Regulative 
Character Design System 
Quality Grade Standards Operations 
Excellence Policy Promotion 


The terms inserted in the boxes are suggestive rather than defini- 
tive, and the balance of my remarks will be directed to the exploration 
of these six terms. 


Design - Directive Control of Quality Character 


Design involves the determination of what properties the product is 
supposed to have to make it useful and desirable. Quality design is just 
as important to raw materials and intermediate products as it is to the 
final items of commerce and use. For instance, the characteristics of a 
picker lap have no significance whatever except as they render it suit- 
able or unsuitable as a feed for a card which in turn delivers a product 
with certain requirements determined by the next operation. This may 
seem very obvious, and yet I think there is a growing tendency in quality 
control work to look backward instead of forward, to measure and judge 
qualities with reference to the capabilities and variations of the pro- 
cess from which they are derived, rather than by relation to character- 
istics of significance in the end use. In the case of the picker lap 
just mentioned, we customarily note variations in weight which are to 
some extent controllable, and devise a system of checks and adjustments 
to keep the average weights to a prescribed level and the variations in 


a workable range. Yet, a deeper investigation might show that the state 
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of fiber orientation and entanglement, or of fiber strain and rupture, 

also materially affect the suitability of the lap on a feed for the card. 
Unless one knows for a fact that these other "qualities" are either con- 

stant or insignificant, it is rather naive to assume that if the weights i 
are satisfactory all is well. Some of these other qualities may be ob- 

scure and difficult to characterize except by refined research tech- 

niques. But it would be rather surprising if, in the last analysis, it 

was found all that counted in these textile intermediates was weight. 


As another example, consider the quality design of a fabric for a 
certain industrial application where high strength is a requirement. 
That would seem to be a straightforward proposition. Yet what do we 
really mean by "high strength", in any particular case? Is it average 
strength in one-time loading, is there a critical spacing of strength- 
giving components, is it retention of strength in repeated loading or 
flexing, is it a unidirectional or a multidirectional requirement, or 
is it only ability to resist distortion under loads much less than ulti- 
mate strength? It has not been customary to consider that such factors 
are in the province of quality control. The usual procedure is to take 
some sample that has been "accepted" and control the production system 
to duplicate it within reasonable tolerance according to obvious and 
conveniently measured characteristics. The characteristics being dupli- 
cated, however, may be only incidental and quite insignificant in terms 
of true requirements. It is not at all uncommon, particularly in indus- 
trial application, to find two items which conform equally in normal 
specifications, but one proved superior to the other in service. There 
is no mysterious anomaly in this, but only failure to properly determine 
quality character and direct the design specifically to have that char- | 
acter. It might very well be, as perhaps in a fabric for reinforcing 
diaphragms, that a rather low average strength could be tolerated as 
long as there were no localized areas of weakness. It should certainly 
seem important for the quality controller to know that requirement so 
that the production and control system be set to eliminate the low ex- 
tremes even if the averages or the statistical variability has to be 
sacrificed to some extent in achieving that objective. 


I can think of an instance of a company pioneering in a new type of 
fabric for an important industrial use. This involved novel manufac- 
turing techniques which were difficult to master. Early production was 
accepted only under the premise that subsequent experience would correct 
a certain apparent "fault" still under criticism. As it happened, a 
potential competitor got wind of the situation, devised a system which 
eliminated the fault, and seemed set to scoop the field while the ori- 
ginal producer was still struggling. It was a technical triumph, except 
for the fact that his improved material didn't work at all in this appli- 
cation. Later analysis showed that the "fault" was the key character- 
istic for success in this critical use. 


Character design is also important even with standard staples of 
presumably non-critical materials. As a simple and yet dramatic case, 
there was a mill of diminishing fortunes suddenly inspired to out- 
modernize its competition and win its way back with a premium grade of 
goods. This objective required a more uniform yarn than was customary 
to this trade, and the new equipment and systems were chosen, after quite 
extensive study and evaluation, with that as the primary criterion. Ini- 
tial samples were beautiful. However, the desired characteristics had 
not been fully and properly defined. This was a fabric of contrasting 
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warp and filling color. Tne new yarn was so perfect and uniform within 
the bobbins that the very slight bobbin-to-bobbin variations caused 
shade bands of surprising contrast. The distribution of this irredu- 
cible variation within the total system was such, since sorting and 
channeling were unfeasible, that the mill was turning out many cuts of 
goods of truly exceptional quality, and almost as many which it would 
be only charity to classify as commercial seconds. There was nothing 
in between. A certain amount of eye-distracting short-term variability 
would have been far preferable to the perfection that had been achieved 
at great expense and effort. The same, or even an appreciably greater 
degree of bobbin to bobbin variation would have been indistinguishable 
if the individual bobbins had been less uniform and solid in appearance. 
A critical determination of what was important in this type of goods, 
in other words, a definition of desired quality character, would have 
prevented this fiasco. The advance evaluation would have then been by 
reference to the specific requirements rather than to what the ordinary 
mill or regulative quality control man would consider general criteria 
for good uniform yarn. 


It is obvious, I think, that this Design phase of Quality Control 
calls for research attitude and research methodology. Facts must be dug 
out by analysis and critical experimentation as basis for sound conclu- 
sions as to what qualities should be controlled. Assumptions or over- 
sight may be dangerous. 


System - Regulation Control of Quality Character 


Once the desired characteristics are determined it is essential to 
devise a system by which they may be checked. That is so logical a step 
that it would seem to require little mention. Unfortunately, it seems 
so patently simple that it sometimes gets no attention at all. The two 
rules for proper regulation of quality character are actually versions 
of the same principle. The one is not to confuse the index with the 
property, and the other, not to assume an intrinsic correlation of two 
properties unless such is proved certain. 


Take, for instance, the matter of uniformity of yarn as a charac- 
teristic. We have mentioned previously a case where the wrong index was 
used insofar as short, medium, and batch uniformities were concerned. 

But take another look at the same problem and we see further that in this 
case, as in most other textile cases, it is bulk uniformity and not weight 
uniformity that is the real characteristic to be achieved and controlled. 
Yet most of our uniformity measuring devices are based on weight, or a 
direct index of weight such as capacitance. Even those devices which do 
sense bulk do so under pressure and confinement designed to minimize 
density of bulk effects, and thus presumably indicate weight. The cor- 
relation between weight and bulk may be good enough for some purposes 
and not for others. I recall the withering scorn addressed to a quality 
control spokesman defending his product in an argument, which must be 
paraphrased for public utterance as, "Young man, I am not buying your 
control charts. My eyes tell me the yarn is lumpy, and you can keep it". 
We have, on the other extreme, the hard-shell who has no faith in any 
instrument determination and trusts only his eye. Some of these gentle- 
men have in fact very discerning judgment and on the seriplane boards 
can consistently detect differences too subtle for present instruments. 
Yet they can be equally wrong, for the final criterion of uniformity is 
that of the effective bulk under the constraints of the woven or knit 
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structure, which are quite different from the conditions on the viewing 


boards. Some types of yarn variability are almost canceled out as far 
as visible effects are concerned under the constraints of the final 
fabric, while other types of variability where stiffness is a correla- 
tive factor are augmented in apparent effect due to the influence on 
crimp balance. This subject of yarn uniformity is highly complex as 
you well know, and I do not intend to launch into a thesis but only to 
raise the question as to whether some of our ingenious and statisti- 
cally sound quality control systems are based on the factors we should 
be measuring and controlling. 


Another case, perhaps more remote from your field but worth men- 
tioning as an extreme case, is that of a cotton mill which ran into 
great difficulty with cotton from a new year's crop, in that it gummed 
up and lapped the draw rolls. It was a sticky problem, and no joke, 
for downtime was mounting to disastrous proportions. The troublesome 
substance seemed sugary in nature, as perhaps from incomplete synthesis 
of cellulose due to peculiar weather in that crop year. With that clue 
a system was set up to check each incoming bale by a reducing-power 
test for sugars. The "sweet" bales were flagged and proportioned into 
the laydown mixes in low percentages, and the problem was brought quickly 
under control. The daily sugar tests were checked as carefully as if 
the mill were a diabetic clinic. This quality control operation was 
followed through the next crop season and the next with great apparent 
success, for though sweet bales were still showing up, no trouble was 
encountered following this system. However, for some reason the labo- 
ratory once fell behind on the tests and for several days mixes were 
made up without regard to sweetness. When the reports finally came 
through it was realized that several of the mixes had been "loaded" 
with sweet bales. The mill didn't know the difference. There was no 
sticking. Here was a case of double confusion, for a reducing test is 
at best only a general index of sugar content, and sugars were only in 
part an index of stickiness. What had proved a highly valuable measure 
under a certain set of conditions was inappropriate and misleading 
under other circumstances. Correlation had been assumed, but not 
proven, and the quality control system adopted without adequate research 
to establish valid premises for the test procedures to insure their 
pertinence to the desired characteristics. 


Standards - Directive Control of Quality Grade 


This function begips to get a little closer to what is ordinarily 
considered "quality control". But there are two research steps in this 
that are quite frequently missed. 


In discussing the matter of Design, we were in a sense working 
toward what might be called the "ideal specification" of qualities we 
would like to attain. The first phase in establishing standards is to 
set this ideal specification out in balance and in quantitative terms. 
In most frequent cases there will be found conflicts based on funda- 
mental factors. For instance, a requirement for thickness might have 
to be reconciled with that for low weight, porosity, strength, or drape. 
Based on relative values of the several requirements, an ideal target 
is set. Upper and lower values should be established, where possible, 
indicating limits beyond which that characteristic is either unaccep- 
table on the one hand, or is in a meaningless range on the other. One 
might, for instance, determine that below a minimum of 20 pounds strength 
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the items would fail to function at all, and above 30 pounds no benefit 
would accrue from the excess. In the aforementioned case of the con- 
trasting color fabric, it could have perhaps been determined that a CV 
for a certain gage length of 15% would disqualify the merchandise, but 
any improvement below 5% would be undetectable in that particular fabric. 


The next step is the determination of the potential specification 
based on the inherent capabilities of the materials and processes with 
which we must work. It is a "gaximum" standard in the sense that it is 
the best we can hope to do if everything goes as perfectly as possible. 
Failure to research these limits has caused untold grief in textiles 
and I imagine in other industries as well. Unless the potential maximum 
specification falls safely and comfortably in the ranges of the ideal 
specification, the mill would be well advised to turn its attention to 
other affairs. All too often a commitment is made without regard to the 
potential specifications, in some vague hope that the production and 
quality control crowd can somehow achieve the ideal specification. 


An excellent example of this point is in the blending of two fibers. 
Suppose a 50/50 blend is the desired norm. An experimental checkup might 
show for the ideal specification, that variability (expressed as the CV 
of the ratio of the percentage of the two fibers in a given gage length) 
in excess of 10% would be unacceptable in view of shade, luster, napping, 
or other requirements. It might also show that below 5% there were no 
discernible differences. 


The potential specification might show that in the production sys- 
tem to be followed there were inherent random elements which precluded 
the chance of operating within the ideal specifications no matter what 
control was exercised, or that the potential was so close to the ideal 
specification tolerance that the additional variation of non-inherent 
factors would throw much of the product beyond the acceptable limit. 

For instance, in a rather intensive statistical research study on two- 
component blending made a year or two ago it was shown that for the 
chosen fibers in their lengths and deniers a CV for blend ratio of below 
8% for a certain gage length was a theoretical improbability. As it 
happened in this case this limit held for both hopper and draw blending. 
If the ideal specification called for a 10% maximum, it meant that 
things were too close for comfort. 


These two theoretical specifications, the ideal and the potential, 
may at first seem only abstract concepts of little value. I would cer- 
tainly grant that since this is the textile business we are talking 
about, and not guided missiles, there is seldom time or funds available 
to do all the research necessary to establish complete and precise speci- 
fications of this sort. However, the key characteristic generally can 
be singled out and determined to a useful approximation. The value lies 
in putting the final operating quality control system in proper per- 
spective. If, for instance, it is known that even the best one can do 
in achieving a certain characteristic is close to the ideal tolerance 
limit, the working tolerances should be adjusted accordingly and cor= 
rective attention concentrated on factors influencing that character- 
istic. If, on the other hand, the quality control staff knows only the 
intended norm, the usual tendency is to set tolerances solely on statis- 
tical considerations of the product variables actually observed rather 
than on variance from limits inherent to the system. The uses of ths 
potential specification or standard are in two directions. On the one 
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hand, it avoids plans and commitments which are impossible of attain- 
ment. On the other, it gives us a much more realistic picture of the 
efficiency of our controls, as in the blending problem for instance. 
By ordinary comparisons an observed CV of 10% might be considered only 
slightly less uniform than a CV of 9%. With reference to a potential 
standard of 8%, however, it is recognized that the 10% is twice as far 
off the beam, or out of control, as the 9%. The main purpose of the 
potential standard is to distinguish the controllable from the uncon- 
trollable, and furnish a meaningful base against which to measure pro- 
duct conformance and in turn, the efficacy of our controls. 


The working specification is the one with which we are most fami- 
liar. It is based largely on previous or current experience. The 
danger is that if the ideal and the potential specification are not 
kept in mind the working specification becomes ‘something sacred and 
immutable. The conditions which dictated the compromises of the work- 
ing specification, at the time it was drawn up, may change, perhaps by 
an incidental changs somewhere along the line, perhaps by deliberate 
corrective action. This may be from a basic modification of the system 
which in effect alters the potential specification and gives opportunity 
to improve the working specification even without any better “control”. 
It may be by a new development allowing cluser control of the control- 
lable variables, with no change in the ideal specification itself. The 
guiding rule is to consider every working standard as tentative and 
subject to revision as research points the way to improvement. 


Qperation - Regulative Control of Quality Grade 


This is the field you know best and I shall not attempt to elabo- 
rate. Recent progress in statistical concepts and methodology has been 
amazingly rapid, and certainly the ASQC deserves great credit for the 
stimulation and education it has provided. My only comment is to cau- 
tion that we do not get so wrapped up in our mathematical methods that 
we get like the statistician's son, in a natural science class then 
studying the nesting habits of birds, who maintained that the domestic 
hen was unique in laying her eggs on an average. It if not unknown in 
the history of quality control that other eggs have been laid on an 
average! 


Policy - Directive Control of Quality Excellence 
Promotion - Regulative Control of Quality Excellence 


To save time I shall discuss these two together. They may seem 
rather nebulous at first. Yet except for that part of quality control 
necessary solely to keep processes operable and efficient, all other 
quality control matters should be within the framework of a definite 
quality policy. Quality excellence is a product feature in the same 
sense as price, style, and technical suitability. It is one of the 
factors which must be weighed in programming both production and sales. 
It may be of key importance for some companies, for some products, for 
some markets. Product excellence may amount to very little under other 
circumstances where competing factors, such as price, dominate. 


A research analysis of a company, product, and market situation 


might, for instance, lead to the conclusion that the corporate fortunes 
would be favored by orienting the whole operation to a price game, 
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cutting every corner to get products out at so low a cost that no one 
would expect much in the way of quality excellence. In another situa- 
tion the odds would favor major attention to style, variety, and versa- 
tility at whatever quality and price might result. Ina third case, 
success might be predicated on outstanding product excellence that would 
gain a premium price and insure customer loyalty. It should not be as- 
sumed that excellence is a virtue that brings its own reward if to at~ 
tain it one is at a disadvantage on other factors. These are the con- 
siderations referred to in speaking of quality policy. 


Analysis of business failures in this industry show many examples 
of companies which made a fetish of product excellence, out of tradi- 
tion or in faith in the better mousetrap theory, but couldn't survive 
in a market where success went to those playing the price or novelty 
game. A list of such failures also includes many which had no real 
opportunity to compete on a cost or versatility basis, and failed to 
seize an available opportunity to build a position on product exce lence. 


It is my contention that responsibility for determining an appro- 
priate quality excellence policy is a proper function of quality control, 
in the expanded role I am suggesting. The most skillful quality control 
system in the business could gain the company little or in fact be self 
defeating if the assumed target is improper for the circumstances. Such 
policy itself should be under continued review,for in this day and age 
things change rapidly. 


I am reminded of a greige mill that found itself in some difficulty 
on general quality of an apparel fabric very important to their program. 
They had followed the usual tradition of grading similar goods, and had 
failed to reckon with changed conditions, in that fashion was calling 
for very revealing plain shades and that resin treatments were being 
applied. With the combination of these two factors, minor irregulari- 
ties and details that were not previously even noted became glaringly 
accentuated. After a first reaction that the market was unreasonable 
to expect the demanded perfection, the mill finally came around and did 
one of the most intensive quality improvement jobs I have ever known. 

It was a classic in what a rigorous quality control program can accom 
plish, though, of course, it took time. Finally, after this campaign, 
the product was admitted by all as the number one make and in a class 
by itself. The mill now experienced no complaints, but also, unfor= 
tunately, almost no orders. 


In a management meeting to try to figure out this dilemma, a 
rather blunt salesman shocked the group by asserting that the only 
trouble with this mill was that it didn't make enough seconds. He 
was not thanked for his opinion at the time, but later analysis showed 
that this mill's seconds and competitors' goods that were no better, 
were taking over the current market at the prices at which they could 
be sold. Fashion now called for bizarre and eye-catching prints, and 
flowing and heavily detailed garment design, which compietely obscured 
the types of irregularities and defects in the greige goods that the 
mill had labored so hard and expensively to overcome. Why pay for a 
quality excellence that no one would see? 


By hindsight, the lessons in this tale are quite apparent. The 


real point of the story is that there were ample facts, available 
sufficiently in advance of the two predicaments in which this mill 
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found itself, which would have guided quality policy so as to capitalize 
on, rather than suffer from, both situations. Quality control must, if 
it is to fulfill its full role, keep one eye on the future as well on the 
problems of the day. Part of this assignment is in a specialized type 

of market research which is only beginning to be recognized, which deter- 
mines what part product quality will play in winning a favorable position. 
Part is in internal research to assess quality capabilities of the plant 
and its organization. The summation is a type of operations research 
calculation in which all factors, of which quality is only one, are 
balanced out for optimum benefit. 


Once policy is set and provision made to keep it current, there is 
the job of implementing it along previously mentioned lines. In addi- 
tion, there is the continuous watchdog responsibility to see that opera- 
tional decisions are consistent with the policy. Aggressively, this 
quality promotion is in pointing out opportunities in equipment, work 
loads, supervision, training, maintenance, etc., which would favor attain- 
ment of quality goals. Defensively, it is in seeing to it that decisions 
made primarily for cost or other reasons do not prejudice ability to 
achieve desired quality. All too often, the quality control staff has 
to struggle with a situation it had no part in creating. 


peaere. tae 


These comments hit only a few high spots of what is a complex sub- 
ject. My intent is only to suggest that perhaps the methodology of 
quality control has advanced further than skills in applying quality 
control for maximum benefit. In the textile business we have not only 
to contend with rough technical control problems due to the nature of 
our materials and processes, but also have the handicap that typical 
textile organizations have yet much to learn in coordinating t‘eir 
staff functions to full advantage. There are opportunities within 
the broad meaning of "Quality Control", however, to both improve 
quality in a relative sense, and to improve the use of quality charac- 
ter and quality excellence as potent features of corporate programming. 
This expanded role of quality control endeavor must draw heavily on 
the attitudes and techniques which by definition are true research 
principles. There is reason to believe that some of our routine 
quality control procedures will prove unnecessary if we can probe deeper 
into our problems. Let us at least be sure we do not lost sight of 
"quality" by over-obsession with quantitative factors. 





ADMINISTRATIVE APPLICATIONS: SOME COMMENTS AND SUGGESTIONS*#® 


Lloyd A. Knowler, Chairman 
Department of Mathematics and Astronomy 
State University of Iowa 


The primary purpose of this report is to call atten- 
tion to some of the elementary principles of Quality Con- 
trol by Statistical Methods or Operations Research which 
are being applied to administrative procedures and possibly 
to suggest some additional approaches. 


In administration, as in manufacturing or any other 
division of an organization, management is concerned with 
the production of a product or service at the least cost 
per usable item. Consequently it is necessary toO evaluate 

@ work in each division in relation to the overall opera- 
tions and to use such methods and techniques as wili con- 
tribute to the objectives of the organization. In so 
doing it is desirable to have the best arrangement of 
functions and duties involved in the operations. To con- 
tinue toward the achievement of this objective will pro- 
bably require at least a periodic study of existing condi- 
tions and possibly a gradual change of operations or a 
shift of responsibilities. A quality control engineer 
is particularly trained to make such studies and he mst 
be used in this capacity if he is to make his greatest 
contribution to the organization. 


A specific request was made to include "clerical 
application of statistical methods" in this report--one 
of the divisions referred to above. 


A brief survey of the work in any office indicates 
the presence of a great deal of administrative detail 
including clerical work. Undoubtedly most, if not all, 
of the detail is, or has been, desirable. Even though 
some of the chores could be eliminated, there is often 
some hesitancy if not reluctance to dispense with then. 

It is observed that the installation and improvement of 
mechanical apparatus eliminates many of detailed operations 
and makes work much more pleasant. However, because of 

the additional information which can de supplied by the 
apparatus and because of the additional demands made upon 
it we find that a comparable elimination of administrative 
detail is not always evident. In fact, the opposite 

often prevails. 





# Summary of a paper presented, by invitation, at the 
Twelfth Annual Convention, American Society for Quality 
Control, Administrative Applications Division, Boston, 
Massachusetts, May 26-28, 1958. 
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One of the things which would concern the quality 
control engineer would be an evaluation of the needs for 
the various operations being performed or being contem- 
plated. Naturally such evaluation would be made in con- 
sultation with management. One executive remarked that 
his organization spent several thousand dollars to lear 
how to do one phase of their work rapidly and then they 
discovered that they could abolish the whole operation 
with the use of a quality control chart. 


Among the specific examples discussed were the 
following: 


1. The application of statistical methods to the work 
of a pool of typists whose work is somewhat varied 
but at the same time somewhat routine. The items of 
error hazards, costs and production were discussed 
in connection with the use of control charts; 


2. Extensions of the principles illustrated in example 
1 were made to other situations such as transporta- 
tion, clerks, sales and printing; 


3-e The application of statistical methods to a small 
operation involving three or four persons or even 
one person; and 


4. The application of statistical methods to duplicate 
readings such as laboratory tests. 
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PROCEDURES AND TABLES BASED ON 
ACCEPTABLE RELIABILITY LEVELS 


D. A. Hill and R. H. Myers 
Hughes Aircraft Company 


Abstract 


Quantitative tests for reliability of equipment were started in 
1955. The method was developed into procedures and tables applicable 
to many types of parts and equipments and over a wide range of Accept- 
able Reliability Levels. The paper includes experience in applying this 
approach to major supplier black boxes of complex systems. 


Introduction 


Electronics has come into its own in the period since World War 
Il. Every type of offensive or defensive tactic depends on black boxes, 
whether they chart a path in space for missiles, provide eyes and ears 
for submarines, or link the operations of widely scattered ground 
forces. Industry has turned to electronics to handle its great quantities 
of data and increasingly to control its processes and machines. The six 
tube receiver has become a two to twenty thousand tube "brain" with 
capacity for memory, calculation and programmed decisions. Thise- 
normous complexity, together with the unfriendly environments in which 
delicate equipment must live, brought about the problem of reliability. 
The question affects other types of equipment, but electronics has had 
the most difficult job. The effort to find solutions has concentrated in 
this field. 


When man seeks control over a problem, he looks for something 
he can measure. It is the essence of quality control that the indefinite 
characteristics of products are replaced with quantitative measures. We 
want to make predictions, watch trends and evaluate the result of 
changes. Reliability presents a different problem from other equip- 
ment parameters. It necessarily includes an added dimension -- time. 
This is generally taken for granted when we consider length, weight, 
voltage output or chemical content. When we test for reliability, we 
may sample a percent of available equipments but we are also sampling 
time. In addition to our interest in reliability trends and control, we 
are also trying to make a reasonable prediction of what will happendur- 
ing subsequent field use. 


Objectives 


Important techniques(!): (2), (3) contributing to designing accept- 
able reliability levels into complex electronic systems have been for- 
malized and are being applied by a few companies. Regardless of how 
valuable these design procedures become, reliability testing will still 
be mandatory for purposes of (1) detecting and correcting unforeseeable 
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part, unit, or system modes of failure, (2) verifying and revising parts 
failure rates used to estimate reliability from the drawing board, (3) 
determining hardware conformance to specified reliability levels, and 
(4) monitoring reliability trends and evaluating the results of changes. 


It is the latter two areas with which this paper concerns itself; 
presented are technical, economic, and statistical aspects and experi- 
ence relative to quantitative reliability testing procedures. 


For tests on manufactured equipment, the objective was to de- 
velop a technique that 
—is practical to run in a manufacturing operation 
—will provide a quantitative measure of equipment reliability 
before shipment 
—can be correlated with performance results in the field 
—helps to highlight the problems that need action. 


With the establishment of overall requirements on the systems 
being supplied to the services, it became important that methods be de- 
veloped for establishing reliability control at system, black box and 
part levels. 


The test methods described in this paper have been applied 
principally to major systems, such as weapons control, navigation and 
communication equipment, and to the subsystems and black boxes that 
go into them. The detailed parts like transformers, relays and gyro- 
scopes have generally been covered by the application of environmental 
and life test requirements. 


In line with the above objectives, a tabulation of reliability test- 
ing procedures and tables were developed. They comprise detailed in- 
structions and standardized tables from which time sampling plans may 
be selected and incorporated into reliability testing specifications. 
Specific development and application considerations related to Accept- 
able Reliability Level techniques are described in succeeding sections 
of the paper. 


Reliability Testing Procedures and Tables 





The time sampling plans derived were mathematically based 
(see the Technical Appendix for related definitions and formulae) on a 
failure distribution per operating time in a chance or random manner 
describable by a Poisson process, i.e,, an essentially constant failure 
rate. Various field studies by ARINC() and Vitro\>) had confirmed 
that the Poisson process describes a large percentage of complex 
equipment failure patterns. Comparable Hughes 6) results based on 
factory test data are shown in Figure 1. Each point represents the av- 
erage number of failures per operating hour of test (for a large group 
of systems) over initial debugging and harmonization, and subsequent 
reliability test periods. 
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The failure rate declines rapidly as debugging, adjustment and 
harmonization proceeds. This is followed by a relatively constant and 
random failure pattern. 


To facilitate application sampling plans were tabulated. As 
shown in Figure 2, all time sampling callouts are in familiar engineer- 
ing terms of operating hours and related failures. The Acceptable Re- 
liability Level, expressed as number of failures per thousand operating 
hours, was termed the ARL. By using the index, failures per thousand 
operating hours, the range covered by the tables can be most conveni- 
ently expressed numerically. Failures per thousand operating hours 
are additive. This permits us to determine system reliability from the 
sum of test results on subsystems (when independence is assumed). The 
reciprocal of ARL, or Mean Time Between Failure, cannot be added. 
In addition most parts failure data is given in terms of failures per 
some base of operating hours, so that studies of the relation between 
parts and systems are facilitated. 


The range of available plans extends from 250 to .001 failures 
per thousand operating hours. 250 failures per thousand operating 
hours (equivalent to a mean time between failure of 4 hours) is a lower 
limit that might be applicable to a very complex system. Conversely, 
.001 failures per thousand hours is an appropriate requirement for a 
single part such as a capacitor. 


As shown in the Figure 3 example, matched sets of single, dou- 
ble and multiple time sampling plans are available which have essenti- 
ally the same risks in their ability to evaluate product conformance to 
the specified reliability level. Choice of which type of reliability test- 
ing to use can, therefore, be based entirely on economic versus admin- 
istrative considerations. For example, single sampling administrative 
advantages can outweigh multiple sampling test time savings in reliabil- 
ity tests of complex systems. 


Provisions for tightened and reduced testing have also been pro- 
vided. Sample Operating Characteristic curves for Normal, Tightened 
and Reduced Time Sampling Plans are outlined in Figure 4. The tight- 
ened reliability criteria (less allowed failures) are instituted to protect 
consumer interests when prior test results reflect an unacceptable re- 
liability level requiring product improvement. Conversely, reductions 
in test times or frequency are permitted (with related decreased costs) 
when acceptable reliability levels have been demonstrated and sus- 
tained. 


Although the procedures were developed primarily for purposes 
of production reliability control, specific time sampling plans (since 
they are independent of the number of items tested) are applicable for 
use by Development groups in qualification type reliability testing 
where few equipments are available. In this Engineering area, use of 
tightened conformance criteria is recommended so as to minimize the 
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chances that product which passes a development reliability test, will 
fail the production counterpart due to sampling fluctuations. 


The time sampling plans and related procedures were derived 
along similar lines to MIL-STD-105A, considered representative of 
conventional inspection sampling methods. The military standard es- 
tablishes a sample size or number of pieces to be drawn from a lot pre- 
sented for acceptance and gives the acceptance number of defectives al- 
lowed inthis sample. The reliability testing tables designate a sample 
of operating hours to be observed on one or more units of product and 
give conformance number of failures. Where MIL-STD-105A refers to 
Acceptable Quality Level (AQL) these tables use the term Acceptable 
Reliability Level (ARL) to denote the minimum acceptable levels of 
quality and reliability respectively. 














The concepts of consumer* and producer* risks are also quite 
Similar. In each case, the larger the sample or test time, the lower 
the risks of considering as acceptable, product of quality or reliability 
poorer than the AQL or ARL. 


The Department of Defense set up an Advisory Group on the Re- 
liability of Electronic Equipment (AGREE). Their report(? includes an 
approach to acceptance tests based on mean time between failure and a 
comparison is of interest. Figure 5 shows ground rules and considera- 
tions relative to each approach. As outlined both are predicated math- 
ematically on a Poisson failure distribution per operating time. The 
AGREE reliability index is in terms of mean time between failure, the 
HAC Tables use the reciprocal term tailures per thousand operating 
hours. HAC procedures provide for matched single, double or multiple 
time sampling plans while AGREE has established a sequential reliabil- 
ity testing procedure. Where AGREE has standardized on various levels 
of test environments depending on operational use of the equipment, HAC 
procedures consider that these will be established by the applicable test 
specification. Where demonstrated nonconformance requires continua- 
tion of, or reversion to 100% reliability testing under AGREE, HAC 
ground rules call for corrective action and/or suspension of shipments 
in addition to instituting a more severe conformance criteria (tightened 
testing). When reliability test results equal or exceed the specified re- 
liability level AGREE procedures require only a sample of equipments 
to be reliability tested; similarly, HAC procedures permit a reduction 
in total test time and/or a reduction in test frequency, e.g., monthly 
to bi-monthly. 


Test risks also differ. Under AGREE ground rules producer 
and consumer risks were fixed at about 10% for an operating ratio* of 
1.5 to 1. Consistent with the above risks, AGREE tests require, if 
equipment is at the specified mean time between failure (MTBF), about 
20 times the specified MTBF to reach a decision. Conversely, HAC 





*Defined in Section 1.0 of the Technical Appendix. 
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producer, consumer risks and operating ratios are variables with re- 
lated test times preselected on the basis of test equipment and manpower 
availability, associated costs and the risks the buyer is willing to live 
with. For example, less complex, more reliable black boxes will re- 
quire a longer test for an adequate measure of their reliability. On the 
other hand, these boxes will be subject to further tests as a part of the 
system. It may make sense to economize at this point and accept a 
larger risk. Subsequent tests will catch up with unfavorable trends. 
The completed system will have a much higher failure rate than any of 
its component items. A relatively shorter time-sample will do the job. 


Application to Reliability Specifications 





With overall requirements established for the systems being 
supplied to the military, it was necessary to pass on to major black box 
suppliers compatible reliability criteria in the form of routine, con- 
sistent and economic test specifications. 


Routine specifications were required so that the inherently 
lengthy cycle of test program development, supplier negotia- 
tion, reliability test, evaluation of results, corrective action, 
retest, etc., would be minimized. 


Consistent specifications were necessary so that the test crite- 
ria by which purchased black boxes were judged, and which con- 
trol test equipment and test operating costs, were equitable, 
economic and specific. In addition, it was advantageous to have 
available consistent material for use as guides to suppliers in 
analogous application to their own operation. 


Economic specifications were needed so that black box reliabil- 
ity levels were evaluated at minimum time and cost through rou- 
tine application of test and sampling procedures. 


First, a standardized military type specification format was es- 
tablished. Called out were the following nine (9) paragraphs. Consid- 
erations appropriate to each are described below. 


..< Purpose 
Contains a brief statement of the specification objectives. For 
example, "To outline the requirements of a reliab‘lity measure- 
ment program designed to determine conformance to the speci- 
fied reliability level." 

2.0 Scope 
Contains a tabulation of black boxes and accessories to which 


the reliability test is collectively applicable. In this area con- 
siderable savings in time and costs are possible through maxi- 
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imum group testing of black boxes by subsystems and suppliers. 
On the other hand, test equipment size or schedule restrictions 
may dictate that a complex system be subdivided into several 
functional reliability test loops. 


Applicable Specifications and Standards 





Appropriate statements as to what specifications or standards 
form a part of the reliability test specification. For example, 
applicable product specification, test procedure, sampling pro- 
cedures and tables, etc. Confusion will result if the buyer and 
supplier do not agree on a complete description of the test con- 
ditions and criteria. 


Acceptable Reliability Level 





Contains a quantitative description of the minimum acceptable re- 
liability level in terms of both failures per thousand operating 
hours and equivalent mean time between failure. Since the speci- 
fied reliability level and related conformance requirements domi- 
nate the reliability test specification, more detailed discussion re- 
garding the development of the reliability requirementis in order. 


Acceptable reliability levels established for black boxes must be 
consistent with overall system requirements. Specifically this 
means all black box failure rates must add to the system failure rate. 
Where black boxes are common to more than one system, it may 
be necessary to set the reliability requirement at a level that will 
meet the most demanding application. Once derived (ona similar 
black box and/or complexity basis) reliability requirements per- 
mit of effective implementation in the following related areas. 


Black box failure rates constitute a logical priority base for gen- 
erating test specifications, on the basis of highest failure rate 
items first. Concentrated effort on the highest failure rate equip- 
ments would seem to be the most rapid and economical means of 
reaching the system requirement. What can be gained by con- 
centration on black box testing instead of waiting to test a com- 
plete system? First, calendar time savings, since reliability 
testing can be instituted and improvements effected much earlier 
in the development, production, field test cycle. Second, it is 
possible to generate a relatively larger number of test operating 
hours resulting in more failures; therefore, related corrective 
action can be instituted earlier. Third, a more intensive relia- 
bility evaluation is usually possible due to more detailed test 
procedures, test instrumentation or test conditions. Also of 
value is that black box requirements when compared with relia- 
bility test results spotlight those units farthest from predicted 
values on which the most fruitful reliability improvement effort 
can be invested. 
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.0 


Definitions 


Contains standard definitions of the primary phrases and state- 
ments found in the various paragraphs; for example, reliability, 
index of product reliability, failure, fault, mean life, acceptable 
reliability level, etc. 


Conditions of Reliability Test 





This paragraph covers the following primary considerations: 
Establishes a standard test operating cycle; 
Establishes the specific test procedure to be used to deter- 
mine malfunctions; 
Establishes the test conditions (i.e., ambient, environ- 
mental or combinations thereof) under which testing is 
conducted. 


As outlined above, the standard test cycle attempts to provide a 
meaningful simulation of actual operating conditions. Specific- 
ally, it establishes when the equipment shall be turned on, exer- 
cised in various tactical modes, checked for malfunctions 
against the test procedure, turned off, etc. 


Repetition of the standard test cycle achieves the total operating 
time required by the applicable time sampling plan callout. This 
makes for maximum flexibility since the time sample can be in- 
creased or decreased, and can use fixed or sequential type sam- 
pling, without affecting the test procedure or standard test cycle 
requirements. 


In addition to the standard test operating cycle described, para- 
graph 6.0 should clearly establish the reliability test procedure 
to be used to determine malfunctions. This should be based on 
an existing test procedure, test specification, product specifica- 
tion, etc. Regardless of its base the reliability test procedure 
should have the following characteristics: be specific; be avail- 
able or readily generated; and (appropriate to black boxes) equal 
or exceed in severity the reliability test requirements the boxes 
must later pass at the systems test level. In this sense, in- 
creased severity can mean more characteristics are checked, 
tighter parameter tolerances are called out, or the application 
of realistic environmental stresses. 


To summarize the approach to test design: the performance test 





procedure (reliability is essentially performance over time), in 


conjunction with a repeated standard test cycle, under ambient 
and/or environmental test conditions provides an economic, 








meaningful and rapidly effective reliability test. 
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7.0 Terms and Conditions 





This paragraph comprises methods of sampling and reporting in- 
puts. Methods of sampling together with the test conditions of 
paragraph 6.0 jointly control test equipment and test operating 
costs. The latter is predicated primarily on engineering con- 
siderations, the former on statistical and probability principles. 


Time ) 


=—_——————) as & 
Failures 


The recognition of mean time between failure ( 


Force 
Area 





statistical variable (like yield strength ( ) but unlike current 


Voltage 


<a ) is important. What this means is that the calcu- 


lated mean time between failure (or failure rate) is subject to 
sampling fluctuations and tends to vary from the "true" value 
more widely if the number of hours used in computing the mean 
time between failure is small. As shown in Figure 6, this vari- 
ability is reflected in Subsystem A experience, where seven (7) 
successive operating hour periods of reliability test indicated 
subsystem mean time between failure of 83, 250, 125, 250, 62, 
200 and 125 hours respectively. In effect, since reliability test 
results inherently vary, it is logical that statistical criteria, 
such as time sampling plans, allowing for such fluctuations, be 
used to determine conformance or nonconformance to the spec- 
ified reliability level. 


Sampling flexibility is also necessary. Procedures should be in- 
dependent of the number of equipments tested, thereby permitting 
use in the critical development phases, when few items are avail- 
able for reliability evaluation. The inevitable question of whether 
to test 4 units for 200 hours each or 8 units for 100 hours also a- 
rises in this area. For all practical purposes, the answer to this 
question would seem to be dictated by test equipment, manpower, 
time, or product availability. For example, development qualifi- 
cation type tests will usually be run on a few items for many 
hours; conversely, production testing will comprise periodic 
testing of a relatively large number of boxes for a relatively 
small number of hours. This inherent development-production 
relationship is not undesirable since it permits early evaluation of 
wear-outtype component failures, whose average times to failure 
extend well out intothe service life of the system. Such failures 
are amenable to scheduled replacement and subsequently consid- 
ered a much less critical reliability area than the random failure. 


The other primary input in the Terms and Conditions paragraph 
concerns the method of reporting and submitting the reliability 
test results. These indicate what reliability test information 
should be submitted, backup data required, frequency of report- 
ing, to whom and by when. 
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8.0 Corrective Action 





Of particular interest to both supplier and buyer is what happens 
when nonconformance to the reliability requirement has been 
demonstrated. Several approaches are available. 


Nonconformance can mean: 
100% testing of product in lieu of sampling, or 
sampling is continued, but corrective action and/or suspen- 
sion of shipments is required. 


Representative of the first approach are AGREE production re- 
liability testing procedures. There is no doubt that the 100% 
test following nonconformance provides the supplier with con- 
siderable corrective action incentive in order to preclude exten- 
sive schedule slippage and increased test costs. However, where 
reliability is at an unacceptable level the necessary corrective 
action, more often than not, requires finding better parts, rede- 
Signing circuits, or improving manufacturing processes. This 
takes time, and 100% testing contributes little to the solution of 
known problems. 


The alternative (continued sampling but concurrent corrective 
action) also has pros and cons. Although providing less incen- 
tive, a supplier would be more inclined to accept a higher (but 
achievable) production mean time between failure on a correc- 
tive action basis in lieu of a lower reliability level when produc- 
tion schedules are involved. In electronics the state of art is 
such that it is better to shoot for a goal that is just out of reach, 
rather than fix a level that is achievable but inadequate. Relia- 
bility testing on a fixed test time basis also has the economic 
advantage of precluding investment in standby test equipment 
and manpower necessary to offset possible 100% test conse- 
quences. 


9.0 Preparation for Delivery 





Contains a brief standard statement covering costs, disposition 
and repair of failed units of product which have been reliability 
tested. 


Conclusions and Results 





The more important conclusions and results were considered to 
be as follows: 


With system reliability requirements and tests becoming stand- 
ard contractual practice, it became important that rapid, consistent, 
economic means be developed for establishing reliability control for 
subsystems and black boxes. 
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Reliability Testing Procedures and Tables were subsequently 
derived comprising detailed instructions and standardized tables from 
which time sampling plans were selected and incorporated into reliabil- 
ity testing specifications. 


Since major supplier black boxes comprise almost 40% of newer 
systems, application effort has been concentrated in this area, with full 
recognition that supplier reliability testing must complement, not re- 
place, timely component and system evaluations at other stages of de- 
velopment and production. 


Use of a standardized military type format permitted economic 
development and rapid initiation of quantitative reliability test specifica- 
tions. 


To date reliability testing and related systematic gathering of 
data has proved to be an effective method of monitoring the results of 
improvement effort. 


Regardless of increased application of design reliability pro- 
cedures, reliability testing will still be mandatory for purposes of (1) 
detecting and correcting unforeseeable part, unit, or system modes of 
failure, (2) verifying and revising parts failure rates used to estimate 
reliability from the drawing board, (3) determining hardware conform- 
ance to specified reliability levels, and (4) monitoring reliability trends 
and evaluating the results of changes. Reliability is in the process of 
joining the established requirements for system design such as power 
required, output, sensitivity or weight. In the future, it will be con- 
trolled on a quantitative basis. 


Technical Appendix 





1.0 Definitions and Assumptions 





1.1 A discrepancy is any malfunction which occurs during 
the course of reliability testing when a system, unit, 
part, ceases to perform satisfactorily. A discrepancy 
is considered to occur at a discrete point in time which 
is the instant of observation of the malfunction. For 
purposes of reliability testing discrepancies have been 
subdivided into failure or fault categories. 


1.2 A failure shall be considered as 
3.d.2 Any out-of-adjustment condition other than those 
which are normally readjusted by a pilot-oper- 


ator in flight. 


1.2.2 Any part failure which is not caused by test 
equipment malfunction, procedural error, or by 
failure of another part. 
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1. 


l. 


1.2.3 Any malfunction whose cause is unknown. 
A fault shall be considered as 


3. 8 Malfunction or damage due to the test equip- 
ment, incorrect procedures, or mishandling. 


SF Out-of-adjustment conditions which are normal- 
ly pilot-operator adjusted in flight. 


1.3.3 Secondary failures resulting directly from the 
failure of another component. 


1.3.4 Manufacturing error, such as use of the incor- 
rect part number, that can be readily corrected 
in all work in process and which will not recur 
in service use. 


Operating time comprises the total time one or more 
sampled units of product are operated in accordance 
with the applicable reliability test specification. 





The reliability index is expressed in terms of failures 
per thousand operating hours. 








Failures per thousand operating hours (f) expressed as 





an equation is 


¢ = F_x 1000 (1) 
T 


where F equals total failures per total operating hours T. 
Failures per thousand operating hours divided into 1000 
is the equivalent of mean time between failure (MTBF) or 
mean life (ML). Expressed as an equation, mean time 
between failure (MTBF) or mean life (ML) is 


1000 
f 


MTBF = ML = (2) 


oT 


where f, T and F are as defined above. 


Producer risk is the probability that product at the ac- 





ceptable reliability level (ARL), when reliability tested 
will be considered nonconforming, due to sampling fluc- 
tuations. 


Consumer 'risk is the probability that product at a spec- 
ific reliability level poorer than the ARL when reliability 
tested will be considered conforming due to sampling 
fluctuations. 
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(1) 


(2) 


(3) 


(4) 


3.9 Operating ratio is the ratio of the Acceptable Reliability 
Level in terms of mean time between failure (or failure 
rate) to a poorer reliability level at a specific consumer 
risk. 





Mathematical Basis 





2.1 The time sampling plans derived are predicated ona 
failure distribution per operating time in a manner de- 
scribable by the Poisson process, i.e., an essentially 
constant failure rate. 


Related Formulas 





x Oe Based on the definitions and assumptions of paragraphs 
1.0 and 2.1 the applicable Poisson distribution may be 
expressed as follows: 

mat.)Ce 7 (T/ML) 
p. = (T/ML) : (3) 
c! 





where Pc is the probability of c failures occurring per 
total operating time T, ML is the mean life or average 
operating tirme between failures, and e is the exponential 
constant (2. 7183). 


When deriving the probability of zero (0) failures during 
total operating hours T, formula (3) for c = 0 reduces to 


Po = e -(T/ML) (4) 
usually known as the exponential distribution. 
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RELIABILITY - A ROLE FOR QUALITY 
A. L. Lambert 
The Martin Company 
Baltimore, Maryland 


Ie Introduction 





A critical attitude, a high degree of curiosity and an abi- 
lity to evaluate objectively - these characteristics are valuable 
tools to those who work in Quality. These tools, useful in every- 
day work, should be helpful in determining the responsibilities of 
Quality in dealing with reliability. 


The need for curious, critical examination and objective 
evaluation stems, in part, from the uncertainty that cloaks the 
reliability field. The nature of the uncertainties are well known. 
It is not infrequent that questions are raised with respect to the 
importance of reliability, the amount of money that should be spent 
for it, how it should be specified and how it should be demonstrat- 
ed. 


In the face of questions such as these, it might well be con- 
cluded that the best Quality approach would be to wait until the 
fundamentals are firmly established. In opposition to this are 
the need for Quality to make active contributions to the advance- 
ment of reliability, and the fact that reliability is now being 
specified contractually. Some recent military proposal require- 
ments have included not only quantitative reliability specifica- 
tions, but also requests for detailed reliability plans with 
corresponding work schedules and manpower allocations. 


The responsibilities of Quality and its role in reliability 
have become matters of immediate significance. 


II. A Matter of Viewpoint 





The first step in a critical look at reliability is to decide 
what it is. One way of doing this is to examine the generally 
accepted definition that the reliability of an equipment is the 
probability that it will perform a pega rep ee under specified 
conditions for a required period of time. 1 The definition is 
fundamentally important for it permits the numerical specification 
of reiiability as a probability under defined conditions of per- 
formance, use and time. 


If we are curious and ask what this definition means, we find, 
not unexpectedly, that it means different things when considered 
from different viewpoints. To the military it can mean a known 
probability of mission success or a maintenance level factor. To 
the designer a numerical reliability specification represents a 
new design parameter. To Quality the reliability number as a de- 
Sign parameter becomes a quality characteristic to be measured, 
controlled and assured. 


This matter of viewpoint would seem to explain some of the 


difficulties of understanding and communication that are common 
in reliability. Because reliability is a relatively new field, it 
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is easy to seize upon a single viewpoint and exclude examination 
of the others; or to gain the impression that reliability is a 
hopeless confusion of not too meaningful terms. Actually, of 
course, there is definite meaning from each viewpoint and there 
need not be inconsistency in saying that reliability is a proba- 
bility of mission success - or a maintenance factor - or a design 
parameter - or a quality characteristic - or all of these; and 
that reliability is a statistical probability under three defined 
conditions. 


Reliability - Another Grey Area? 





From the Quality viewpoint it might be adequate to say that 
reliability as a design parameter is a quality characteristic. 
It may be helpful, however, to examine the meaning of this 
quality characteristic. 


There is a tendency to categorize reliability with what might 
be called "grey area" product quality characteristics such as 
producibility and maintainability (Figure 1(a)), as distin- 
guished from characteristics such as performance, weight, 
service life and selling price which traditionally have re- 
ceived greater emphasis. This approach would result in the 
delivery of a product with field characteristics as shown in 
Figure 1(b). It is interesting to note that the "grey area" 
characteristics generally are those which are not measurable. 
Notice that the "grey area" approach to reliability is unsat- 
isfactory is spelled out by the contractual specification of 
measurable, numerical reliability requirements. 


Reliability - A Figure of Operational Quality 





The demands of the reliability concept are not satisfied by 
the "grey area" approach; but a moments reflection indicates 
that even removal of the "grey" from the reliability block of 
Figure 2 does noi result in an adequate description of the 
characteristic. 


Reliability as a quality characteristic is more accurately 
portrayed as a numerical Figure of Operational Quality which 
encompasses the field characteristics of the product. This 
concept (or viewpoint) is indicated by Figure 2 which shows, 
only in part, the field characteristics of the product. 


To consider reliability as a figure of operational quality 
is, of course, a matter of relationship and does not remove the 
validity of the reliability definition nor the need to treat re- 
liability as a design parameter. It is intended to indicate the 
proper importance of reliability to Quality and to suggest the 
scope of work with which Quality is concerned if reliability is to 
be measured, controlled and assured as a quality characteristic. 


III. Elements of the Task 





To consider reliability as a figure of operational quality 
has advantages in terms of perspective. The questions of the re- 
sponsibilities of Quality and the role it should play in a 
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reliability program require, however, an examination of the work 
to be done. It is then reasonable to consider Quality functions 
in terms of relationship to the functions of Engineering, Manu- 
facturing, Procurement and other segments of organization. 


Reliability requires treatment through all phases of product 
life from the specification of requirements in the request for 
bid, through the proposal phase, design, development, production 
and into service usage. Perhaps the principles of the role of 
Quality can best be demonstrated by examination of the elements of 
work in the early phases where Quality activity is usually at a 
minimum. 


Areas for Quality Emphasis 





1. Proposal Phase 





The reliability task in the proposal phase is shown in brief 
form in Figure 3. The Request for Bid should reasonably be 
expected to contain the operational requirements, including 
the factors of usage, maintenance, and logistics in addition 
to the numerical reliability requirements. Elements of the 
reliability task include reliability studies, an estimate of 
environment, estimates of reliability, preparation of a re- 
liability program and an estimate of reliability costs. 


What are the functional responsibilities of Quality in a 
proposal phase that contains these tasks? The answer, in 
practice, will depend on the functional part that Quality 
plays in the particular organization. If, for example, the 
Quality function is generally limited to examination of the 
physical product, then the Quality task in proposal prepara- 
tion may be confined to the submission of manpower estimates 
to satisfy the requirements of the reliability program. On 
the other hand, when the Quality function is broad, Quality 
can play an active part in the proposal preparation phase of 
the reliability program. 


If it is the functional responsibility of Quality to assure 
that control is maintained and to assure the reliability of 
the product, then the proposal phase of reliability becomes 
critical in the discharge of this responsibility. This stems 
from the basic consideration that the reliability proposal 
not only permits but also limits, in terms of dollars, the 
effort to be expended in later phases of the program. It is 
not only reasonable, but also necessary in terms of Quality 
effectiveness, that the proposed reliability program contain 
the necessary elements and be adequate to translate the 
numerical reliability requirement into a proven design 
characteristic and into a demonstrated quality characteristic 
of the end product. 


Another example of a factor important in this phase is the 
specification itself. If vagueness exists in the reliability 
specification or in the specified means of demonstration - 
both common types of problems in normal quality work - then 


the proposal phase offers the most effective time for clari- 
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fication and the elimination of these potentially serious 
quality problems. 


2. Design and Development Phase 





Another area where the influence of Quality is historically 
low, but where reliability demands attention, is the product 
design and development phase. If we examine the elements of 
work in a reliability program in this phase we find a situa- 
tion which is basically outlined by Figure 4. The program 
consists of: (1) an organizational capability, (2) an ade- 
quate definition of the task, (3) a planned series of design 
and development steps calculated to satisfy the specified 
reliability requirements, (4) controls designed to evaluate 
progress and assure the achievement of objectives, and (5) an 
adequate demonstration of successful attainment of reliabi- 
lity objectives. 


Here, again, the functions assigned to Quality will vary from 
organization to organization. The traditional tasks such as 
receiving inspection and manufacturing in-process control re- 
main important and necessary even in the production of the 
prototypes and test models associated with this phase. The 
effectiveness of Quality in assuring the required operational 
quality (i.e., reliability) in the end product is greatly en- 
hanced, however, by a more extensive Quality participation in 
the design and development phase. It is not intended to 
imply that design responsibility be removed from the engineer 
or to suggest that Engineering not have primary responsibi- 
lity for control of design activities. This primary respon- 
sibility, however, does not remove the responsibility of 
Quality to assure that controls are adequate to satisfy the 
end requirements of the product. 


With reference to Figure 4, it is reasonable, for example, 
that Quality determine that the organizational capability 
exists for execution of the reliability program, that the 
definition of the task is adequate, that the reliability 
plans and controls contain the necessary elements, and that 
adequate provision is made for demonstration. Through the 
reporting of defects at the time of demonstration, for 
example, it is very possible to determine that an element 
such as Human Factors was omitted or inadequately specified 
in the initial definition of the task. From the quality as- 
surance standpoint it is more effective and of far more bene- 
fit to the program if such a discrepancy is discovered sever- 
al years earlier by an examination of the task definition 
during its formation. Similar comments are applicable to the 
essential steps in the reliability program and to the reliabi- 
lity controls developed for the design and development phase. 


In the control and assurance of reliability as a quality 
characteristic, Quality faces this basic situation. The time of 
reliability demonstration is too late (probably by several years) 
for effective quality control and quality assurance efforts. The 
reliability level achieved in the design cannot be improved 
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significantly by traditional Quality efforts in the manufacturing 
phase. Last, and by far the most important, is the conclusion 
that Quality can make its most effective contribution by active 
participation in the reliability program from the time of incep- 
tion in the proposal preparation phase. 


IV. Application of the Principles 





In a discussion of principles it is normally desirable to in- 
clude a concrete example of an application. For this purpose, the 
steps which have been taken by Martin, Baltimore are offered for 
review. 


The management organization for reliability at Martin, Balti- 
more is as shown in Figure 5, with the responsibility for coordi- 
nation placed within the Quality Division. It should be noted 
that reliability is recognized as presenting tasks to not only 
Engineering and Quality, but also to Manufacturing, Procurement 
and Customer Service. It is pertinent to note that in addition 
to coordination, Quality has the defined functions; (1) to estab- 
lish, in conjunction with the various activities, the responsibi- 
lity for performance of reliability functions, (2) to participate 
in project reliability planning, (3) to aid in the development of 
techniques for the analysis, prediction, measurement and attain- 
ment of reliability, (4) to perform independent analyses of custo- 
mer reliability requirements, and (5) to evaluate project reliabi- 
lity plans, efforts and results, and provide effectiveness reports 
to management for control purposes. 


The organizational concept is less than a year old and full 
implementation is progressing but is not yet complete. The steps 
which have been taken within Quality are those of interest in 
discussing the role of Quality and are perhaps best described by 
the functional organization chart of the Quality and Reliability 
Assurance Department which was formed about six months ago. The 
department head reports to the Director of Quality and performs 
the reliability coordination function described in Figure 5. The 
shaded areas of Figure 6 indicate the departmental activities 
which make a direct quality contribution to the Company's relia- 
bility effort. 


The Quality and Reliability Analysis Section serves as the 
nerve center in the corrective action feedback loop of the Com= 
pany's product failure and trouble analysis system. The section 
collects both field and in-plant information (including informa-. 
tion from design and development projects). Beyond the statisti- 
cal analysis of data, the section performs engineering investiga- 
tion and evaluation of troubles and failures to select significant 
problems. Corrective action feedback is provided and is followed 
by participation of the section in further investigation and the 
resolution of problems. Another function of the section is that 
of preparing periodic management reports on product trends and 
corrective action status. 


The quality audit function in the Quality Engineering 
Section includes the audit and evaluation of reliability programs. 


As a matter of interest, the basic function calls for audits of 
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those factors which affect end product quality. This encompasses 
not only reliability programs and the quality control system, but 
also functions within Procurement, Manufacturing and Engineering. 


The Reliability Engineering activity includes that of assign- 
ment of reliability engineers to advance design activities where 
reliability proposals are prepared. As shown in Figure 6, relia- 
bility engineers are assigned to projects having reliability pro- 
grams in later phases; these engineers participate in the project 
reliability planning and control efforts. 


In this review it is not intended to create the impression 
that reliability functions are solely a Quality responsibility. 
This is not the case. It should be noted, in this respect, that 
the Reliability Engineers assigned to projects are administrative- 
ly responsible to the projects and functionally responsible to 
Quality. The prime responsibility for leadership, direction and 
control of project reliability efforts rests with the Project 
Engineering Department of the Engineering Division. 


The Martin, Baltimore organization for reliability and the 
selection of Quality efforts are the result of careful study and 
evaluation of the reliability problem. Experience to date with 
this approach is felt to be good and is felt to demonstrate the 
soundness of the approach. 


Ve. Conclusions 


The role of Quality in the field of reliability may be con- 
troversial for some time to come. Nevertheless, the requirement 
that reliability be controlled and assured is now a matter of con- 
tractual specification. If it is logical that reliability as a 
design parameter becomes a quality characteristic, then the Quality 
problem of adequate treatment of this characteristic is real and 
immediate. 


The role of Quality in the treatment of the reliability 
characteristic may be expected to vary from organization to organ- 
ization. It is suggested, however, that the most effective role 
for Quality requires not only the application of traditional ef- 
forts but also the acquisition of new skills and the active parti- 
cipation of Quality in the early phases of product inception and 
planning. 
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SOLVING LINEAR PROGRAMMING PROBLEMS WITH CONTROL CHARTS 
By 


J. L. Dolby 
General Electric Company 


In the past several years there has been a good deal of discussion 
about linear programming as a handy tool for solving induétrial problems. 
On the other hand, there has been nearly as much discussion in the direc- 
tion of: What is it? How do you use it? Who has used it successfully? 
Outside the general area of what we might call "economic" problems, the 
number of successful applications in industry has been rather limited. 

At least part of this limitation has been caused by the rather formidable 
mathematics involved in deriving the various solutions presently avail- 
able. In this discussion we will present a somewhat simpler procedure 
that is probably relatively inefficient in the mathematical sense, but is 
much easier to understand and to explain to someone else. 


First we must define the general problem. Let's take a hypothet- 
ical example. Suppose we have a chemical process as in Figure 1, where 
we wish to determine the proper volume of the vessel, the operating 
pressure, and Monomer to catalyst ratio. In so doing, we wish to mini- 
mize the impurity content in PPM. We have available an equation that 
relates these in the following way: 


Impurity content (PPM) 
= 100 - 16 (volume) - 4 (pressure) 
+ 12 (ratio) 





ee Ete RESTRICTIONS : 
2 < volume < 12 liters 
Polymer | 
+ — Monomer 2 < pressure < 7 atmospheres 
Impurities a 3 < Monomer < 10 
Rete Nr Catalyst 
Catalyst 








(Pressure) (Volume) < 15 


Figure l 


It is the introduction of these restrictions that makes programming a 
useful tool. There have been mathematical tools available for some time 
that would enable us to find the maximum of many functions as long as 
there were no restrictions involved. However, in realistic industrial 
situations we must always face up to the fact that length cannot exceed 
12 inches, or the weight cannot exceed 10 pounds, or the assembly must be 
capable of operating efficiently at 120° F., etc. 


595 








Linear programming then appeals to everyone in its initial promise, 
precisely because it enables us to tackle realistic problems. It is, by 
definition, restricted to mathematical expressions that are linear, and 
this would probably explain why it is more useful in economic circum- 
stances than in any other area, since it is possible to express many ec- 
onomic situations by linear equations, 


The drag on the further application of linear programming has been 
the difficulty in understanding it. Most papers are either highly tech- 
nical or highly mechanical, i.e., they discuss the problem in the con- 
text of the theory of convex sets, or they provide a set of “cook-book" 
steps without detailing the reasons for making these operations. Unfor- 
tunately this is inherent in the nature of the procedure. It is a very 
neat mathematical method, but its derivation is complicated cand it is 
extremely difficult to give an intuitive description of it. However, if 
we are only interested in obtaining the minimum cost of a particular op- 
eration we can always use the trial and error procedure. In particular, 
we can make random trials, evaluate the cost of each trial, and select 
that combination that provides the lowest cost. 


Going back to our example, suppose the volume must lie between 2 
and 12 liters, the pressure between 2 and 7 atmospheres, and the catalyst 
ratio between 3 and 10. Furthermore, to meet temperature restrictions, 
the product of pressure and volume must not exceed 15. We could then 
select random numbers between 2 and 12 for the volume, 2 and 7 for the 
pressure, and 3 and 10 for the ratio, and calculate the impurity content, 
We would, of course, throw out any solutions that exceeded the tempera- 
ture restriction. Twenty-five trials are listed below: 





TABLE I 
Se Impurity 
Trials Volume Pressure Ratio Content 
1 3 4 7 120 
> 2 3 10 176 
3 4 3 6 96 
4 3 5 10 152 
5 4 2 4 76 
6 2 4 6 124 
7 3 4 5 96 
8 5 2 7 96 
9 2 5 5 108 
10 4 3 4 72 
11 3 2 & 140 
12 3 2 3 80 
13 4 2 5 FT) 
14 6 2 7 80 
15 2 5 . 144 
16 4 3 7 108 
17 3 2 9 152 
18 3 2 4 92 
19 4 2 7 112 


g 


TABLE I (CONT'D) 





Impurity 
Trials Volume Pressure Ratio Content 
20 6 2 5 56 
21 2 4 10 172 
22 5 3 8 104 
23 3 2 4 92 
24 3 5 8 128 
25 7 2 9 88 


Of course, this would not lead to the exact answer unless we took 
an infinite number of trials. However, as quality control people we do 
not normally require an exact estimate of the quality of a lot of ma- 
terial. We are perfectly willing to sample the lot because we know 
that beyond a certain point the information gained by further sampling 
does not equal the cost of further work. Even though this is a purely 
mathematical problem the same principle holds here. If we were doing 
this problem by hand, we would continue only until we were satisfied 
that further work would not sufficiently improve our answer to justify 
the cost. 


As a means of determining when we should quit we can plot the 
results on a control chart, ignoring for the moment, the particular com- 
bination of length, width, and weight that we choose, and record only 
the impurity level. Figure 2 shows such a plot of the 25 trials listed 
above, with 3o control limits. 





= Vy 71 


o | 








Figure 2 
In this case, as with most control charts, we are going to be particular- 


ly interested in points that lie beyond the control limits. Since only 
two points fall below the limits in the first sample of 25, we took 
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another 50 trials to obtain a better idea of what conditions would lead 
to points on the low side. At the end of these 75 trials we had 8 points 
out of control low: 


TABLE II 
Impurity 
Volume Pressure Ratio Content 
4 3 4 72 
6 2 5 56 
. 2 3 48 
6 2 3 32 
7 2 5 40 
6 2 3 32 
4 2 3 64 
7 2 5 40 


Qn examination it turns out that the volume of the vessel is always 
between 4 and 7 liters, the pressure is between 2 and 3 atmospheres, and 
the catalyst ratio between 3 and 5. 


Since in this case we want the low values of impurity content 
we want to generate some more points that will fall in this low region 
to give a better idea of low impurity performance requirements. We then 
calculated 20 or more results using the new limits defined by the low 
points of the initial runs. These results are summarized in Figure 3. 
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Figure 3 


We see immediately that virtually all these trials are below the low 
limit of the previous control chart. Furthermore, we find that the first 
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of these trials leads to an impurity content of only 16 PPM. 


We could, of course, continue this analysis another step, but this 
should be enough for demonstration purposes. We can, however, get 
further useful information from this approach. Clearly, the volume of 
the vessel used in this hypothetical process would be a constant. How- 
ever, we can expect that the pressure and the catalyst ratio will vary 
in actual operation. We could then set the volume constant (at 7 atmos- 
pheres since that was the value for the best run) and choose random 
values of the pressure and catalyst ratio as close to the optimum as 
is feasible. For this particular case we might say that pressure can 
vary between 2.0 and 2.1 and the ratio can vary from 3.0 to 4.0. Twenty 
five such runs are given in Table III. We can then expect that 
impurities will range between 15 and 26 PPM. A definitive statement 
would depend upon assumptions about the distribution. (We have used 
uniform distributions in all these examples.) 


TABLE III 
Impurity 


Pressure Ratio Content 


i) 
o 


25.6 
22.8 
17.2 
24.0 
25.2 
25.6 
17.2 
23.2 
16.8 
22.0 
18.4 
15.6 
19.2 
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In summary this particular procedure is quite simple to understand 
and to use. For very small problems its mathematical efficiency is 
poor, but even this difficulty is wiped out as the problems become more 
complex. Furthermore this procedure is not limited to linear problems, 
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nor does it require stringent mathematical assumptions, These basic 
advantages should be sufficient to establish this as a working tool in 
industry. 





RESEARCH IN THE ORBIT OF QUALITY CONTROL 


Paul E, Allen 
Beech Aircraft Corporation 


Research in the orbit of Quality Control has been going on for 
many years by experts far greater than 1; however, the results of these 
efforts published in voluminous forms have left me confused as to their 
objectives and even greater still as to just what members of our pro- 
fession think Quality really is -- is Quality making a part precisely 
match a drawing, or is Quality making a part that will fulfill its 
mission and provide customer satisfaction? I repeat -- is Quality 
making a part precisely match a drawing, or is Quality making a part 
that will fulfill its mission and provide customer satisfaction? 


With a firm understanding of Beech Corporate Management philosophy 
of product quality, my own personal curiosity, and the cooperation of 
hundreds of you in Industry, I have tried to connect into an integrated 
program the best results of research in many of the apparent tangential 
concepts orbiting around the quality axis. 


For the purpose of reader interest, I will start with our Corporate 
Management philosophies, our facilities, our products and our customers 
to give you a better understanding of why the efforts outlined in this 
paper actually have taken, or are taking, place. 


Beech Corporate Officials have gone on written record to all 
BEECHCRAFT employees expressing their views of product quality, which 
briefly are: 


PRESIDENT AND CHAIRMAN OF THE BOARD 


a 
‘ » Quality, to me, is that invaluable characteristic 
Ww ~— = that makes one brand better than another. A 
o quality product is dependable . . . outstanding 
in appearance and performance .. . and com- 

Peat petitive in price. 


President 


VICE PRESIDENT - GENERAL MANAGER AND DIRECTOR 


QUALITY IS OUR MOST IMPORTANT PRODUCT 


Vice President-General Manager 
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VICE PRESIDENT - COORDINATOR AND DIRECTOR 


An Upward Sales Trend is Possible Only Through 
SATISFIED CUSTOMERS 


Vice were ee 


wages te 





You can see from the above that we in Beech Quality Control have 
our objectives pretty well defined by our Corporate Management. 


It was Walter Beech's wish that Beech have a conscience -- that 
conscience would be responsible for assuring that BEECHCRAFT products 
were of the best comparative quality and were able to fulfill their 
intended mission. In fulfilling Walter Beech's wish, Beech Management 
combined Quality Control and Customer Service into a single department 
and gave me the assignment of Department Manager. The reasoning behind 
this decision was to facilitate the implementation of an adequate data 
feedback system to assure a maximum degree of product evaluation which 
is necessary to properly control the product quality. 


The Beech facilities are located in several factories, offices and 
test units in Wichita, Herington and Liberal, Kansas, and Boulder and 
Longmont, Colorado. 


The Beech products consist of: 


I. Military -- 








1700-Gallon Training Aids 
Tank 





II. Commercial -- 








P— ah 
== 7 I 
Bonanza Travel Air Twin-Bonanza Super 18 
III. Subcontract -- 
4 4a Sits Po 
=> E22 Ss 
_— a2 oe 
Wings Aft Fuselages Assemblies Fuel Tanks 


The Beech customer's list contains many Governments and elite of 
the free world. 


Our products must satisfy the requirements of all applicable 
Government Agencies including CAA, Air Force, Army, Navy, Atomic Energy 
Commission, several prime weapons systems contractors, and about 800 
annual commercial, domestic and export purchasers; therefore, our prod- 
uct control system must be versatile -- effective, yet not cumbersome. 


Now with this background history, you can see that our Quality job 
is clearly defined and simple -- provide outstanding quality products 
that will fulfill their intended mission and provide customer satis- 
faction. 





Up to this point, you can see that everything is clear-cut with no 
need for exploration or research -- now we explore the implementation 
of a plan to provide the results required by Management. All we have 
to do is: 


(1) Provide top product quality. 
(2) Provide product ability to fulfill its intended mission. 
(3) Provide customer satisfaction. 


(I am sure you will all agree that you do not have a true quality 
product unless your product fulfills the above three basic prerequi- 
sites.) 


One would normally assume that the task of implementing a product 
control program to fulfill three simple requirements, such as those 
listed above, could hardly necessitate any research effort -- these are 
pure cut-and-dried age cld traditional philosophies and habits -- or 
are they? 


When we find Manufacturing, Inspection, Procurement, Engineering, 
Service and Sales discussing a quality item under induced warm climatic 








conditions sometimes augmented by hypertensive physical conditions, I 
must reconcile myself into asking, "What is Quality? Where is it de- 
fined?" The only fair way in my opinion to arrive at the answer to this 
basic question is to ask expert, novice, customer, designer, seller, 
builder, inspector and service men, "What in their opinion Quality 
really is", and arrive at a composite opinion. 


Yes, this is the logical, simple approach to the problem and, once 
we have carried it out, we have eliminated the problem once and for all. 


We make a questionnaire and ask 50 people the question, “What in 
your opinion is the job of Quality Control?” -- just as quick as we get 
50 answers, we find that the job of Quality Control is to inspect parts 
to assure conformity to the engineering data. 


Is this our cure-all answer? Well, the Customer Service Department 
says a large portion of our customer dissatisfaction comes from design 
items -- the customer says the Quality is poor, because it won't work. 


Hmmm -- Inspection says it's like the drawing -- yet it won't 
fulfill its mission and the customer is unhappy. So our first research 
answer definition of Quality Control's function is inadequate as it does 
not appear to fulfill 2 of our 3 basic requirements. 


Perhaps the 3 basic requirements are wrong -- 30 we ask over 100 
people from 15 industries and Government Agencies: 


(1) When you buy an article such as a car, boat, motor, radio, 
television set, etc., what do you expect from that article 
in the way of Quality? 

(a) Good workmanship? 
(b) Conformance to design? 
(c) Ability to run and do its intended job? 

(2) How would you define Quality in one sentence? 

(3) What in your opinion is Quality Control? 


The answers to this inquiry, which are shown on Figure 1, average 
out as follows: 


(a) 66% of the people considered ability of a product to do 
its intended job was the most important quality factor. 


(b) 23% of the people considered good workmanship was the 
most important quality factor. 


(c) 11% of the people considered conformance to design the 
most important quality factor. 


Quality Control was generally defined as being: The technique, 
program or policy used in attaining and assuring the quality level. 


Quality was generally defined as being: The measure of the product 


or service relative to its ability to perform and provide customer 
satisfaction against some standard of performance of perfection. 
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Well, the recap of this study would appear to indicate it is gen- 
eral opinion that a quality product should be able to fulfill its mis- 
sion, have good appearance and supply adequate customer satisfaction. 
So our first 3 basic requirements for Quality Control still appear to 
be applicable. 


Maybe Customer Service is wrong and design isn't a major factor in 
product quality -- so we ask Management or Command personnel serving in 
Engineering, Quality Control, Manufacturing, Testing and Flight Safety 
Groups in the aircraft, missile, engine and equipment industry, and the 
Air Force, Army, Navy, Airlines and the CAA, the following questions: 


What percent of a product's service problems would you estimate are 
due to: 


(a) Design? 

(b) Nonconformance to design? 

(c) Vendor component not suited to a system? 
(d) Misuse of the product by the customer? 
(e) Other? 


The results of this research questionnaire which are shown in 
Figure 2 reveal that, in the opinion of those polled, the customers’ 
service problems are: 


(a) 38.6% due to design. 
(b) 25.8% due to misuse of the product. 
*(c) 12.6% due to vendor component not suited to the system 
application. 
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(d) 11.2% due to nonconformance to the design. 
(e) 11.7% due to other causes. 


*In most cases, these were specified by design. 





















































PRODUCT CONTROL AND RELIABILITY 
QUESTIONN:IRE SUMMARY SHEET AVERAGE 
CATEGORY OF WORK ENGINEERING | QUaLITY CONTROL | SERVICE ] MILITARY USER | GENERAL 
TWEROE 7AVER TVEROGE AVERAGE AVERAGE 
sq «| DESIGN 50% 4lg 2% 3% 38. 
=} 
NONCONFORMANCE TO , . 
E DEsion 11% 13% & 10% 11.2% 
ae 
&oO 
©] VENDOR COMPONENT NOT : 
Bs ee © ws 7% 12 1s 12.6% 
& 
MISUSZ OF PRODUCT BY ‘. ‘ 
Eg CUSTOMER 2b 20% 33% 30% 25.0% 
og 
i] S 
a & OTHER % *” 20% 15% 11.7% 
Figure 2. 


(It is interesting to note that Engineers felt 50% of the problems 
were due to design while the mixed average was only 38.6%, and the 
Inspection personnel felt 13% of the problems were due to nonconformance 
to design while the mixed average was 11.2%. The military users felt 
that 30% of the problems were due to misuse while the mixed average was 
25.8% -- it appears that each Group imposed more self-criticism than the 
over-all average.) 


Now we have generally defined Quality and we have generally located 
where the areas are where Quality Control is needed -- how do we get it? 
This should be the easiest task of all as our Library has many books and 
periodicals pertaining to the subject, so all we need to do is read a 
few stories and we are in business managing a Quality Control operation. 


Well, at least it should be simple -- or shouldn't it? We review 
many issues of our ASQC's periodical and find that about 76% of the 
articles pertain to the scientific expose of the mathematical applica- 
tion of numerical quantities of variance or lack of variance of a char- 
acteristic of parts from a specification or drawing. Our researcher -. 
failed to report any significant amount of text devoted to areas of 
Quality Control other than conformance to design wherein only 11.2% of 
the customer problems exist. 


Hmmm -- maybe the simple task of Quality Control implementation is 
just complex in the one area, nonconformance to design, where the 11.2% 
of the customers’ problems are, as most of the ASQC periodical effort 
is, in this area -- should we assume this to be true and go on. Well, 
we're skeptical, but we have to assume something, so we go along with 
the statistical theorist -- to a degree -- and find that product quality 
as pertaining to design conformance can be better guided by application 
of statistical Quality Control to the manufacturing process by charting 
the results of sample inspections. 


This makes good sound logic until our researcher -- a precise, 


persistent, persuasive personality comes in from the Library with a 
report from a well-known company with quotes from Universities, etc., 





which says that 100% inspection will only catch 68% of the defective 
parts and 400% inspection will only catch 98% of the defective parts. 


Well, with this revelation, you realize that it's a good thing 
that only 11.2% of the customers’ problems are due to nonconformance 
to design and proceed to put your Quality Control Research Engineer to 
work by ordering a research study to explore a few simple basic prin- 
ciples of product control, such as: 


(a) The best type of human being to inspect various categories 
of parts. 

(b) The cost of inspection vs. accuracy of inspection by 
different methods. 

(c) The hypnotic effect of repetitive inspection on different 
types of people. 

(d) The comparative inspection ability to catch the defects 
in a lot of parts when the same lot is checked by the 
inspection system in five different companies. 

(e) The ability of Engineers to check parts to their own 
designs. 


This program has just started; however, the preliminary results 
are indeed interesting, such as: 


(1) Although the pay rates of the Engineers selected for the 
tests averaged 2.2 times that of the Inspectors, their 
performance was 8.47% less effective. 

(2) The cost of equipment necessary to check the parts, as 
determined by two Inspectors, varied from $858.20 to 
$3,106.75. 

(3) The man with the lowest inspection equipment cost re- 
quirement was 98% effective, while the man with the 
highest equipment cost requirement was 98% effective 
in catching the defective parts. 


Another area of research is the vendor-furnished item failures and 
the causes for these failures. In reviewing the area of vendor items 
that fail to function in a system, we find several factors contributing 
to the problems, such as: 


(1) The test article sent in by the vendor is a top, rather 
than average, performer. 

(2) Some vendors seem to be more interested in getting the 
order than they are in the suitability of the component 
for the job. 

(3) Some suppliers appear to have little interest in their 


product or reputation as pertaining to quality or product 
reliability. 


The results of these preliminary research efforts indicate that 
the term Quality Control cannot be limited to the checking or inspec- 
tion of materials or parts for conformity to a design, but must assume 
a much broader scope if our intent is to provide products that have the 
functional quality required to fulfill the intended mission. 


Possibly we in ASQC should evaluate the distribution of effort by 
our Society so we publicize and discuss with proper proportions each 
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area of product control and not lean specifically to pure statistics. 


Possibly we should more fully evaluate human capabilities so as to 
validate the statistical assumptions. 


But most of all, let us not forget that our whole objective should 
be to assure customer satisfaction -- without customers for our products, 
our future would be very limited. May the Good Lord never let us forget 
that good customers are hard to get. 








QUALITY'S NEGLECTED CUSTOMER - THE ENGINEERING DEPARTMENT 


; Henry A. Schumer 
Chief Engineer, Asheville Plant 
International Resistance Company 


If the Quality Control Department is to make a significant con- 
tribution, it must recognize the Engineering Department as a customer 
and it must recognize the customer's needs just as it does in the usual 
customer - vendor relationship. Thus, the Quality Department must 
evaluate its relationship with Engineering by asking itself these 
questions: 


1. Is it providing a service which satisfys Engineering 
needs? 

2. Is this service more effective than others available? 

3. Is an acceptable product delivered, on time? 


Unless the answers to all of these questions are yes, the use of 
statistical quality control techniques may result in a less effective 
solution to a problem than if some other more orthodox (from the 
Engineering Department point of view) technique was used. 


It is not unusual for the Quality Department to be in a position of 
determining whether the product which it has to sell is consistent with 
the customer's needs. Its usual function is to evaluate the product of 
others to establish whether it is in truth satisfactory. However, the 
same concept should apply when dealing with Engineering problems, except 
in this case it is one of soul searching to determine whether the tech- 
niques which are available to the Quality Engineer are necessary for 
performing the job at hand, or whether there are other methodologies 
which would provide a cheaper, faster, but equally as efficient or 
effective answer. In many areas the most efficient technique may be 
nothing more than "practical engineering judgement"; whereas, the added 
sophistication and cost of a statistical treatment of the problem may 
be the most inefficient. It is, therefore, our obligation to recognize 
and assist the customer in deciding which techniques are applicable. In 
most instances, the measurement or success of a solution is determined 
by the cost savings, reliability, or conformance to specification limits. 
A careful evaluation of the consequences of a wrong decision must be 
made and unless it is clear that the time invested in the use of some 
subtle technique can be justified, advocating its use would be improper. 


Once we have established that the problem is one which is within 
the realm of statistical quality control, and by this we mean economi- 
cally and effectively within the realm of statistical techniques, it is 
important that we dispel much of the mystery involved in the use of this 
science. We, the practitioners of the art, must not only be able to 
understand the limitations of statistical techniques, but also be capa- 
ble of putting them in their proper perspective so that we do not fall 
into the trap of making a decision which, while technically sound, many 
times has little meaning in the business or operational sense. 


The third factor which must be considered is the delivering of an 
acceptable product on time, and it is here that many quality investi- 
gators are guilty of unclear thinking. The acceptable product which 
must be provided is a complete analysis of the data, so that the "line" 








function (the Engineering Department) can make the correct decision, and 
not a decision as to what future action is required. On the surface 
this may not seem to be a significant problem; however, unless this 
principle is closely adhered to, the relationship between Quality and 
Engineering will be fraught with misunderstandings and mistrust and as a 
result could very well negate the long term value of the Quality Depart- 
ment. As the Quality Control - Engineering relationship matures, sug- 
gestions and recommendations presented by Quality will not only be 
expected but will be sought. 


The maturing process requires the complete removal of suspicion re- 
garding methods or techniques as well as intent, and since suspicion of 
new things is a natural tendency among most people it is best handled by 
familiarization with the new technique or method. Wherever possible, 
the Engineering staff should be provided with "Quick and Dirty" tech- 
niques of a "Cook Book" type so that they may become familiar with the 
methods involved. A further extension of this technique would be in the 
form of a training agenda which might include the following topics: 


1. Fundamental statistical techniques as applied to 
Engineering problems. 

2. Introduction to experimental design. 

3- Review of previous Engineering decisions based on 
the use of these techniques. 


The company organization and position of the Quality Department is 
also a critical factor in determining the success of Quality Control 
techniques in the Engineering function. However, regardless of its area 
of responsibility, in the final analysis the success of the program will 
be directly related to the sales ability of the practitioner. In my 
experience I have been actually responsible for the failure of a program 
by forcing of issues because of company position and as a result alien- 
ated the persons involved so as to reduce the overall effectiveness. A 
sound foundation for this activity must be based on mutual trust, con- 
fidence, and understanding, and cannot be ordered in. Therefore, the 
program must be carefully planned and thoroughly sold beforehand. 


It is true that individual battles can be won and in some cases re- 
calcitrants must be shown the way; however, it is obvious that this 
method of conversion is to be avoided when possible. The best approach 
to sell this program is to convince the Engineer customer that he needs 
what you have to sell and that you can make his job easier or make him 
more efficient. In a short time this should lend to a clearer awareness 
of the general field of application and perhaps to requestefor applica- 
tion of more sophisticated techniques on Engineering data. Have no 
fears - Engineering personnel are too occupied with engineering problems 
to become expert statistical analysts (even Quality Engineers do not 
enjoy this luxury). 


Unfortunately, only the minor effects of Quality Control assistance 
to Engineering are recognizable in dollars and cents; a reduction in 
experimental time, the reduction in test samples, replication of experi- 
ments, etc. The really significant gains in increased efficiency of 
decisions and reduction of overall development time from conception to 
factory operation are not readily evaluated but are unquestionably the 
major factors. 
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Over the years IRC recognized the need for "a procedure to delegate 
responsibility for initiating, developing, and manufacturing of new 
products". In 1953 Mr. G. Entrekin, then Chief Product Engineer, and 
Mr. L. Jacobson, then Manager of Quality Control, developed the follow- 
ing procedure which has resulted in many of the benefits discussed above. 


Table I illustrates the mechanism of the procedure whose salient 
points are: 


1. Assignment of project by top management to a project 
leader (usually Research Division) whose overall re- 
sponsibilities are clearly defined. 

2. The review of the project by top management after 
preliminary evaluations have been made. 

3. If in the opinion of top management the market 
potential, product performance, and profit potential 
dictate continuance of the project, the project is 
turned over to Engineering for further development. 

4. After the Engineering development is complete, which 
includes design of processing equipment, all tool and 
product drawings, manufacturing.procedures, quality 
standards, cost standards, etc., a pilot operation is 
set up jointly by the operating plant personnel and 
the engineering group with all plant staff services 
participating in both the design of the pilot plant 
operation as well as evaluation of the results. 


Following this system materially reduces the time and money in- 
volved in the development of a new product. It also provides all groups 
and departments with an opportunity to review and comment on a product 
prior to its introduction to production with the concomitant confusion 
and delay which results from attempting to do engineering on the 
production floor. 
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